wo 2005/061718 



PCT/GB2004/005462 



GENE EXPRESSION TECHNIQUE 
FIELD OF THE INVENTION 

5 The present application relates to gene expression techniques. 
BACKGROUND OF THE INVENTION 

The class of proteins known as chaperones have been defined by Hartl (1996, 
10 Nature, 381, 571-580) as a protein that binds to and stabilises an otherwise 
unstable conformer of another protein and, by controlled binding and release, 
faciUtates its correct fate in vivo, be it folding, ohgomeric assembly, transport to a 
particxalar subcellular compartment, or disposal by degradation. 

15 BiP (also known as GRP78, Ig heavy chain binding protein and Kar2p m yeast) is 
an abundaiit -70kDa chaperone of the hsp 70 family, resident in the endoplasmic 
reticulum (ER), which amongst other functions, serves to assist in transport in the 
secretory system and fold proteins. 

20 Protein disulphide isomerase (PDI) is a chaperone protein, resident in the ER that 
is involved in the catalysis of disulphide bond formation duriag the post- 
traQslational processing of proteins. 

Studies of the secretion of both native and foreign proteins have shown that transit 

25 from the ER to the Golgi is the rate-limiting step. Evidence points to a transient 

association of the BiP with normal proteins and a more stable interaction with 

mutaat or misfolded forms of a protein. As a result, BiP may play a dual role in 

solubilising folding precursors and preventing the transport of xmfolded and 

unassembled proteins, Robinson and Wittrup, 1995, Biotechnol Prog, 11, 171- 

30 177, have examined the effect of foreign protein secretion on BiP (Kar2p) and 

PDI protein levels in Saccharomyces cerevisiae and found that prolonged 
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constitutive expression of foreign secreted proteins reduces soluble BiP and PDI 
to levels undetectable by Western analysis. The lowering of ER chaperone and 
foldase levels as a consequence of heterologous protein secretion has important 
impUcations for attempts to improve yeast expression/secretion systems. 

5 

Expression of chaperones is regulated by a nimiber of mechanisms, including the 
xmfolded protein response (UPR). 

Using recombinant techniques, multiple PDI gene copies has been shown to 
10 increase PDI protein levels in a host cell (Farquhar et al, 1991, Gene, 108, 81-89). 

Co-expression of the gene encoding PDI and a gene encoding a heterologous 
disulphide-bonded protein was first suggested in WO 93/25676, published on 23 
December 1993, as a means of increasing the production of the heterologous 
15 protein. WO 93/25676 reports that the recombinant expression of aiitistasin and 
tick anticoagulant protein can be increased by co-expression with PDI. 

This strategy has been exploited to increase the recombinant expression of other 
types of protein. 

20 

Robinson et a/, 1994, Bio/Technology, 12, 381-384 reported that a recombinant 
additional PDI gene copy in Saccharomyces cerevisiae could be used to increase 
the recombinant expression of hxmian platelet derived growth factor (PDGF) B 
homodimer by ten-fold and Schizosacharomyces pombe acid phosphatase by four- 
25 fold. 

Hayano et al, 1995, FEES Letters, 311, 505-511 described the co-expression of 
human lysozyme and PDI in yeast. Increases of around 30-60% in functional 
lysozyme production and secretion were observed. 

30 
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Shusta et al, 1998, Nature Biotechnology, 16, 773-777 reported that the 
recombinant expression of single-chain antibody fragments (scFv) in 
Saccharomyces cerevisiae could be increased by between 2-8 fold by over- 
expressing PDI in the host cell. 

5 

Bao & Fukuhara, 2001, Gene, 272, 103-110 reported that the expression and 
secretion of recombinant human serum albumin (rHSA) m the yeast 
Kluyveromyces lactis co\ild be increased by 15-fold or more by co-expression with 
an additional recombinant copy of the yeast PDI gene (KIPDIJ). 

10 

In order to produce co-transformed yeast comprising both a PDI gene and a gene 
for a heterologous protein, WO 93/25676 taught that the two genes could be 
chromosomally integrated; one could be chromosomaUy integrated and one 
present on a plasmid; each gene could be introduced on a different plasmid; or 

15 both genes could be introduced on the same plasmid. WO 93/25676 exemplified 
expression of antistasin from the plasmid pKH4a2 m yeast strams ha\dng a 
chromosomally integrated additional copy of a PDI gene (Examples 16 and 17); 
ejqpression of antistasin from the vector K991 with an additional PDI gene copy 
being present on a multicopy yeast shuttle vector named YEp24 (Botstein et al, 

20 1979, Gene, S, 17-24) (Example 20); and expression of both the antistasin and the 
PDI genes from the yeast shuttle vector pCl/1 (Rosenberg et al, 1984, Nature, 
312, 77-80) under control of the GALIO and GALl promoters, respectively. 
Indeed, Robinson and Wittrup, 1995, op. cit., also used the GALI-GALIO 
mtergenic region to e^qpress erythropoietin and concluded that production yeast 

25 strains for the secretion of heterologous proteins should be constructed using 

« 

tightly repressible, inducible promoters, otherwise the negative effects of 
sustained secretion (i.e. lowered detectable BiP and PDI) would be dominant after 
the many generations of cell growth required to fill a large-scale fermenter. 
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Subsequent work in the field has identified chromosomal integration of transgenes 
as the key to maximising recombinant protein production. 

Robinson et al, 199 A, op. cit, obtained the observed increases in expression of 
5 PDGF and & pombe acid phosphatase using an additional chromosomally 
integrated PDI gene copy. Robhason et al reported that attempts to use the multi- 
copy 2|Lim expression vector to increase PDI protein levels had had a detrimental 
effect on heterologous protein secretion. 

10 Hayano et al, 1995, op. cit. described the mtroduction of genes for human 
lysozyme and PDI into a yeast host each on a separate linearised integration 
vector, thereby to bring about chromosomal integration. 

Shusta et al, 1998, op. cit, reported that in yeast systems, the choice between 
15 integration of a transgene into the host chromosome versus the use of episomal 
expression vectors can greatly affect secretion and, with reference to Parekh & 
Wittrup, 1997, Biotechnol Prog., 13, 117-122, that stable integration of the scFv 
gene into the host chromosome using a 5 uitegration vector was superior to the use 
of a 2iam-based expression plasmid. Parekh & Wittrup, op. cit, had previously 
20 taught that tiie expression of bovine pancreatic trypsm inhibitor (BPTI) was 
increased by an order of magnitude usmg a 5 mtegration vector rather than a 2|am- 
based expression plasmid. The 2|jm-based expression plasmid was said to be 
counter-productive for the production of heterologous secreted protein. 

25 Bao & Fukuhara, 2001, op. cit, reported that "It was first thought that the KlPDIl 
gene might be directly introduced into the multi-copy vector that carried the rHSA 
expression cassette. However, such constructs were found to severely affect yeast 
growth and plasmid stability. This confirmed our previous finding that the 
KlPDIl gene on a multi-copy vector was detrimental to growth of K. lactis cells 

30 (Bao et al, 2000)". Bao et.al, 2000, Yeast, 16, 329-341, as referred to in the 
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above-quoted passage of Bao & Fuknohara, reported that the KlPDIl gene had 
been introduced into K. lactis on a multi-copy plasrhid, pKan707, and that the 
presence of the plasmid caused the strain to grow poorly. Bao et al concluded that 
over-expression of the KlPDIl gene was toxic to K. lactis cells. In the light of the 
5 earUer findings in Bao et al, Bao & Fukuhara chose to introduce a single 
duphcation oi KlPDIl on the host chromosome. 

Against this background, we have surprisingly demonstrated that, contrary to the 
suggestions in the prior art, when the genes for a chaperone protein and a 
10 heterologous protein are co-expressed on a 2|J.m-family multi-copy plasmid in 
yeast, the production of the heterologous protein is substantially increased. 

DESCRIPTION OF THE INVENTION 

15 A first aspect of the present invention provides a method for producing 
heterologous protein comprising: 

(a) providing a host cell comprising a 2pm-family plasmid, the 
plasmid comprising a gene encoding a protem comprising the sequence of 

20 a chaperone protein and a gene encoding a heterologous protein; 

(b) culturing the host cell in a culture medium under conditions that 
allow the expression of the gene encoding the chaperone protein and the 
gene encoding a heterologous protein; 

25 

(c) purifying the thus expressed heterologous protein from the culture 
medium; and 

(d) optionally, lyophilising the thus purified protein. 

30 
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In one embodiment, step (c) purifies the thus expressed heterologous protein to a 
commercially acceptable level of purity or a pharmaceutically acceptable level of 
purity. 

5 Preferably, the method farther comprises the step of formulating the purified 
heterologous protein with a carrier or diluent, such as a pharmaceutically 
acceptable carrier or diluent and optionally presenting the thus formulated protein 
in a unit dosage form. 

10 A second aspect of the present invention provides for the use of a 2|Lim"family 
plasmid as an expression vector to increase the production of a fungal (preferably 
yeast) or vertebrate heterologous protem by providing a gene encoding the 
heterologous protein and a gene encoding a protein comprismg the sequence of a 
chaperone protem on the same 2|Lim-family plasmid. 

15 

A third aspect of the present invention provides a 2pni-family plasmid comprising 
a gene encoding a protein comprising the sequence of a chaperone protein and a 
gene encoding a heterologous protein, wherein if the plasmid is based on the 2\xm 
plasmid then it is a disintegration vector. 

20 

A fourth aspect of the invention provides a host cell comprising a plasmid as 
defined above. 

The present invention relates to recombinantly modified versions of 2|j.m-faniily 
-25 plasmids. 

Certain closely related species of budding yeast have been shown to contain 
naturally occurring circular double stranded DNA plasmids. These plasmids, 
collectively termed 2|j-m-family plasmids, include pSRl, pSB3 and pSB4 from 
30 Zygosaccharomyces rouxii (formerly classified as Zygosaccharomyces bisporus), 
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plasroids pSBl and pSB2 from Zygosaccharomyces bailih plasmid pSMl firom 
Zygosaccharomyces fermentatU plasmid pKDl from Kluyveromyces 
drosphilarum, an un-named plasmid from Pichia membranaefaciens (hereinafter 
"pPMl") and the 2\xni plasmid and variants (such as Scpl, Scp2 and Scp3) from 
5 Saccharornyces cerevisiae (Volkert, et aL, 1989, Microbiological Reviews, 53, 
299; Murray et al, 1988, J, Mol Biol 200, 601; Painting, et al, 1984, J. Applied 
Bacteriology, 56, 331). 

As a family of plasmids these molecules share a series of common features in that 
10 they typically possess two inverted repeats on opposite sides of the plasmid, have 
a similar size around 6-kbp (range 4757 to 66 15 -bp), three open reading frames, 
one of which encodes for a site specific recombinase (FLP) and an autonomously 
replicating sequence {ARE), also known as an origin of replication {pri\ located 
close to the end of one of the inverted repeats. (Futcher, 1988, Yeast 4, 27; 
15 Murray et al, op. cit, and Toh-e et a/., 1986, Basic Life Sci. 40, 425). Despite 
their lack of discernible DNA sequence homology, their shared molecular 
architecture and the conservation of fimction of the three open reading frames 
have demonstrated a common ancestral linlc between the family members. 



20 Whilst any of the above naturally occurring 2|j,m-family plasmids can be used in 
the present invention, this invention is not limited to the use of naturally occurring 
2jxm-family plasmids. For the purposes of this invention, a 2^m-family plasmid 
is as described below. 

25 A 2|am"family plasmid is a circular, double stranded, DNA plasmid. It is typically 
small, such as between 3,000 to 10,000 bp, preferably between 4,500 to 7000 bp, 
excluding recombinantly inserted sequences, 

A 2}im-family plasmid typically comprises at least three open reading frames 
30 ("ORFs") that each encodes a protein that frmctions in the stable maintenance of 

7 
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the 2fxm-family plasmid as a multicopy plasmid. The proteins encoded by the 
three ORFs can be designated FLP, REPl and REP2. Where a 2pm-family 
plasmid comprises not all three of the ORFs encodmg FLP, REPl and REP2 then 
ORFs encoding the missing protehi(s) should be supplied in trans, either on 
5 another plasmid or by chromosomal integration. 

A "FLP" protein is a protein capable of catalysing the site-specific recombination 
between inverted repeat sequences recognised by FLP. The inverted repeat 
sequences are termed FLP recombination target (FRT) sites and each is typically 

10 present as part of a larger mverted repeat (see below). Preferred FLP proteins 
comprise the sequence of the FLP proteins encoded by one of plasmids pSRl, 
pSBl, pSB2, pSB3, pSB4, pSMl, pKDl, pPMl and the 2|Lan plasmid, for 
example as described in Volkert et al, op. cit, Murray et al, op. ciL, and Paintmg 
et al , op. cit. Variants and fragments of these FLP proteins are also included in 

15 the present invention. "Fragments" and "variants" are those which retain the ability 
of the native protein to catalyse the site-specific recombmation between the same 
FRT sequences. Such variants and fragments will usually have at least 50%, 60%, 
70%, 80%, 90%, 95%, 98%, 99%, or more, homology with an FLP protein encoded 
by one of plasmids pSRl, pSBl, pSB2, pSB3, pSB4, pSMl, pKDl, pPMl and the 

20 2|Lun plasmid. Different FLP proteins can have different FRT sequence 
specificities. A typical FRT site may comprise a core nucleotide sequence flanked 
by inverted repeat sequences. In the 2|Lim plasmid, the FRT core sequence is 8 
nucleotides in length and the flanking uiverted repeat sequences are 13 nucleotides 
in length (Volkert et al, op. cit.). However the FRT site recognised by any given 

25 FLP protein may be different to the 2|jm plasmid FRT site. 

REPl and REP2 are proteins involved ui the partitioning of plasmid copies during 
cell division, and may also have a role in the regulation of FLP expression. 
Considerable sequence divergence has been observed between REPl proteins 
30 from different 2|im-faniily plasmids, whereas no sequence aUgnment is possible 
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between REP2 proteins derived from different 2|Lim-family plasmids. Preferred 
REPl and REP2 proteins comprise the sequence of the REPl and REP2 proteins 
encoded by one of plasmids pSRl, pSBl, pSB2, pSB3, pSB4, pSMl, pKDl, 
pPMl and the 2|j,m plasmid, for example as described in Volkert et al, op, cit, 

5 Murray et al, op, cit, and Paiatiag et al, op. cit. Variants and fragments of these 
REPl and REP2 proteins are also included in the present invention. "Fragments" 
and ^Variants" of REPl and REP2 are those v^hich, when encoded by the plasmid in 
place of the native ORF, do not substantially dismpt the stable multicopy 
maintenance of the plasmid within a suitable yeast population. Such variants and 

10 fragments of REPl and REP2 will usually have at least 5%, 10%, 20%, 30%, 40%, 
50%, 60%, 70%, 80%, 90%), 95%o, 98%, 99%), or more, homology with a REPl and 
REP2 protem, respectively, as encoded by one of plasmids pSRl, pSBl, pSB2, 
pSB3, pSB4, pSMl, pKDl, pPMl and the 2|Lim plasmid. 

15 The REPl and REP2 proteins encoded by the ORFs on the plasmid must be 
compatible. It is preferred that the REPl and REP2 protems have the sequences 
of REPl and REP2 proteins encoded by the same naturally occurring 2M,m-family 
plasmid, such as pSRl, pSBl, pSB2, pSB3, pSB4, pSMl, pKDl, pPMl and the 
2|jm plasmid, or variant or fragments thereof. 

20 

A 2}xm-family plasmid typically comprises two inverted repeat sequences. The 
inverted repeats may be any size, so long as they each contain an FRT site (see 
above). The mverted repeats are typically highly homologous. They may share 
greater than 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, 99.5% or 
25 more sequence identity. In a preferred embodiment they are identical. Typically 
the inverted repeats are each between 200 to 1000 bp in length. Preferred inverted 
repeat sequences may each have a lengfii of firom 200 to 300 bp, 300 to 400 bp, 
400 to 500 bp, 500 to 600 bp, 600 to 700 bp, 700 to 800 bp, 800 to 900 bp, or 900 
to 1000 bp. Particularly preferred inverted repeats are those of the plasmids pSRl 
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(959 bp), pSBl (675 bp), pSB2 (477 bp), pSB3 (391 bp), pSMl (352 bp), pKDl 
(346 bp), the 2|Jin plasmid (599 bp), pSB4 or pPMl. 

The sequences of the inverted repeats may be varied. However, the sequences of 
5 the FRT site in each inverted repeat should be compatible with the specificity of 
the FLP protein encoded by the plasmid, thereby to enable the encoded FLP 
protein to act to catalyse the site-specific recombination between the inverted 
repeat sequences of the plasmid. Recombination between inverted repeat 
sequences (and thus the ability of the FLP protein to recognise the FRT sites with 
10 the plasmid) can be determmed by methods Icnown in the art. For example, a 
plasmid in a yeast cell under conditions that favour FLP expression can be assayed 
for changes in the restriction profile of the plasmid which would result from a 
change in the orientation of a region of the plasmid relative to another region of 
the plasmid. The detection of changes in restriction profile indicate that the FLP 
15 protein is able to recognise the FRT sites in the plasmid and therefore that the FRT 
site in each inverted repeat are compatible with the specificity of the FLP protein 
encoded by the plasmid. 

In a particularly preferred embodhnent, the sequences of inverted repeats, 
20 including the FRT sites, are derived firom the same 2Mm-family plasmid as tihe 
ORF encoding the FLP protem, such as pSRl, pSBl, pSB2, pSB3, pSB4, pSMl, 
pKDl, pPMl or the 2pm plasmid. 

The inverted repeats are typically positioned with the 2|jm-family plasmid such 
25 that the two regions defined between the inverted repeats (e.g. such as defined as 
UL and US in the 2fxm plasnud) are of approximately similar size, excluding 
exogenously introduced sequences such as transgenes. For example, one of the 
two regions may have a length equivalent to at least 40%, 50%, 60%, 70%, 80%, 
90%, 95% or more, up to 1 00%, of tibe length of the other region. 

30 
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A 2pin-faxmly plasmid typically comprises the ORP that encodes FLP and one 
inverted repeat (arbitrarily termed "IRl" to distinguish it from the other inverted 
repeat mentioned in the next paragraph) juxtaposed in such a manner that IRl 
occurs at the distal end of the FLP ORF, without any intervening coding sequence, 
5 for example as seen in the 2\xm plasmid. By "distal end" in this context we mean 
the end of the FLP ORF opposite to the end from which the promoter initiates its 
transcription. In a preferred embodiment, the distal end of the FLP ORF overlaps 
with IRL 

10 A 2pm-family plasmid typically comprises the ORF that encodes REP2 and the 
other inverted repeat (arbitrarily termed "IR2" to distinguish it from IRl 
mentioned in the previous paragraph) juxtaposed in such a manner that IR2 occurs 
at the distal end of the REP2 ORF, without any intervening coding sequence, for 
example as seen in the 2p,m plasmid. By "distal end" in this context we mean the 

15 end of the REP2 ORF opposite to the end from which the promoter initiates its 
transcription. 

In one embodiment, the ORFs encoding REP2 and FLP may be present on the 
same region of the two regions defined between the inverted repeats of the 2|j,m- 
20 family plasmid, which region may be the bigger or smaller of the regions (if there 
is any inequality in size between the two regions). 

In one embodiment, the ORFs encoding REP2 and FLP may be transcribed from 
divergent promoters. 

Typically, the regions defined between the inverted repeats (e.g. such as defined 
as UL and US in the 2|im plasmid) of a 2fmi-family plasmid may comprise not 
more than two endogenous genes that encode a protein that functions in the stable 
maintenance of the 2p,m-family plasmid as a multicopy plasmid. Thus in a 
preferred embodiment, one region of the plasmid defined between the inverted 

11 
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repeats may comprise not more than the ORFs encoding FLP and REP2; FLP and 
REPl; or REPl and REP2, as endogenous coding sequence. 

A 2|j,m-family plasmid typically comprises an origin of replication (also known as 
5 an "autonomously replicating sequence - "ARS"). which is typically bidirectional. 
Any appropriate ARS sequence can be present. Consensus sequences typical of 
yeast chromosomal origins of repUcation may be appropriate (Broach et al, 1982, 
Cold Spring Harbor Symp, Quant Biol, 47, 1165-1174; Williamson, Yeast, 1985, 
1, 1-14). Preferred ARSs include those isolated from pSRl, pSBl, pSB2, pSB3, 
10 pSB4, pSMl, pKDl, pPMl and the 2|j,m plasmid. 

Thus, a preferred 2pm-family plasmid may comprise ORFs encoding FLP, REPl 
and REP2, two inverted repeat sequences each inverted repeat comprising an FRT 
site compatible with the encoded FLP protein, and an ARS sequence. Preferably 

15 the FRT sites are derived from the same 2pm-family plasmid as the sequence of 
the encoded FLP protein. More preferably the sequences of the encoded REPl 
and REP2 proteins are derived from the same 2|Lim-family plasmid as each other. 
Even more preferably, the FRT sites are derived from the same 2{xm-family 
plasmid as the sequence of the encoded FLP, REPl and REP2 proteins. Yet more 

20 preferably, the sequences of the ORFs encoding FLP, REPl and REP2, and the 
sequence of the inverted repeats (including the FRT sites) are derived from the 
same 2|Lmi-family plasmid. Furthermore, the ARS site may be derived from the 
same 2)Lim-family plasmid as one or more of the ORFs of FLP, REPl and REP2, 
and the sequence of the uiverted repeats (including the FRT sites). 

25 

The term "derived from" includes sequences having an identical sequence to the 
sequence from which they are derived. However,'Variants and fragments thereof, 
as defined above, are also included. For example, an FLP gene having a sequence 
derived from the FLP gene of the 2\im plasmid may have a modified promoter pr 
30 other regulatory sequence compared to that of the naturally occurring gene. 

12 
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Additionally or alternatively, an FLP gene having a sequence derived from the 
FLP gene of the 2|Lim plasmid may have a modified nucleotide sequence in the 
open reading firame which may encode the same protein as the naturally occurring 
gene, or may encode a modified FLP protein.' The same considerations apply to 
5 other sequences on a 2)J,m-family plasmid having a sequence derived fi:om a 
particular source. 

Optionally, a 2|im-family plasmid may comprise a region derived from the STB 
region (also known as REPS) of the 2|am plasmid, as defmed in Vollcert et al, op. 

10 cit. The STB region in a 2|xm-faimly plasmid of the invention may comprise two 
or more tandem repeat sequences, such as three, four, five or more. Alternatively, 
no tandem repeat sequences may be present. The tandem repeats may be any size, 
such as 10, 20, 30, 40, 50, 60 70, 80, 90, 100 bp or more in length. The tandem 
repeats in the STB region of the 2|j,m plasmid are 62 bp in length. It is not 

15 essential for the sequences of the tandem repeats to be identical. Slight sequence 
variation can be tolerated. It may be preferable to select an STB region from the 
same plasmid as either or both of the REPl and REP2 ORFs. The STB region is 
thought to be a cz^'-acting element and preferably is not transcribed. 

20 Optionally, a 2iLim-family plasmid may comprise an additional ORF that encodes a 
protein that functions in the stable maintenance of the 2iLim-family plasmid as a 
multicopy plasmid. The additional protein can be designated RAF or D. ORFs 
encoding the RAF or D gene can be seen on, for example, the 2|jm plasmid and 
pSMl. Thus a RAF or D ORF can comprise a sequence suitable to encode the 

25 protein product of the RAF or D gene ORFs encoded by the 2|LLm plasmid or 
pSMl, or variants and fragments thereof Thus variants and fragments of the 
protein products of the RAF or D genes of the 2|Ltm plasmid or pSMl are also 
included in the present invention. "Fragments" and 'Variants" of the protein 
products of the RAF or D genes of the 2|am plasmid or pSMl are those which, 

30 when encoded by the 2|Lim plasmid or pSMl in place of the native ORF, do not 
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disrupt the stable multicopy maintenance of the plasmid within a suitable yeast 
population. Such variants and fragments will usually have at least 5%, 10%, 20%, 
30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98%, 99%, or more, homology with 
the protein product of the RAF or D gene ORFs encoded by the 2\xm plasmid or 
5 pSMl. 

A naturally occurrmg 2|Lim-faxnily plasmid may be preferred. A naturally 
occurring 2|am-family plasmid is any plasmid having the features defined above, 
which plasmid is found to naturally exist m yeast, i.e. has not been recombinantly 

10 modified to mclude heterologous sequence. Preferably the naturally occurring 
2iLun-family plasmid is selected from pSRl (Accession No. X02398), pSB3 
(Accession No. X02608) or pSB4 as obtained from Zygosaccharomyces rovxii^ 
pSBl or pSB2 (Accession No. NC_002055 or M18274) both as obtained from 
Zygosaccharomyces baillU pSMl (Accession No. NC_002054) as obtained from 

15 Zygosaccharomyces fermentatU pKDl (Accession No. X03961) as obtained from 
Kluyveromyces drosophilarum, pPMl from Pichia membranaefaciens or, most 
preferably, the 2\mi plasmid (Accession No. NC_001398 or J01347) as obtained 
from Saccharomyces cerevisiae. Accession numbers in this paragraph refer to 
NCBI deposits. 

20 

The 2|am plasmid (Figure 1) is a 6,318-bp double-stranded DNA plasmid, 
endogenous in most Saccharomyces cerevisiae strains at 60-100 copies per 
haploid genome. The 2pm plasmid comprises a small unique (US) region and a 
large umque (UL) region, separated by two 599-bp inverted repeat sequences, 
25 Site-specific recombination of the inverted repeat sequences results in inter- 
conversion between the A-form and B-form of the plasmid in vivo (Volkert & 
Broach, 1986, Cell 46, 541). The two forms of 2\mi differ only m the relative 
orientation of their unique regions. 
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While DNA sequencing of a cloned 2|jni plasmid (also laiown as Scpl) from 
Saccharomyces cerevisiae gave a size of 6318-bp (Hartley and Donelson, 1980, 
Nature, 286, 860), other slightly smaller variants of Ipm, Scp2 and Scp3, are 
known to exist as a result of small deletions of 125-bp and 220-bp, respectively, in 

5 a region Icnown as STB (Cameron et aL, 1977, Nucl Acids Res,, 4, 1429: Kikuchi, 
1983, Cell, 35, 487 and Livingston & Hahne, 1979, Proc. Natl Acad. Set USA, 
76, 3727). In one study about 80% of natural Saccharomyces strains from around 
the v^^orld contained DNA homologous to 2|Lim (by Southem blot analysis) 
(HoUenberg, 1982, Current Topics in Microbiology and Immunobiology, 96, 119). 

10 Furthermore, variation (genetic polymorphism) occurs within the natural 
population of 2|jm plasmids found in S. cerevisiae and S. carlsbergensis, with the 
NCBI sequence (accession number NC_00 13 98) being one example. 

The 2fxm plasmid has a nuclear locaUsation and displays a high level of mitotic 
15 stability (Mead et al, 1986, Molecular & General Genetics, 205, 417). The 
inherent stability, of the 2fim plasmid results from a plasmid-encoded copy 
nxmiber amplification and partitioning mechanism, which can be compromised 
during the development of chimeric vectors (Futcher & Cox, 1984, j: Bactejnol, 
157, 283; Bachmair & Ruis, 1984, Monatshefte fur Chemie, 115, 1229). A yeast 
20 strain, which contains a 2|jni plasmid is known as [cir*^], while a yeast strain 
which does not contain a 2fxm plasmid is known as [cir^] . 



The US-region of the 2|im plasmid contains the JREP2 and FLP genes, and the 
UL-region contains the REPI and D (also known as RAF) genes, the 5'Zg-locus 

25 and the origin of replication (Broach & Hicks, 1980, Cell, 21, 501; Sutton & 
Broach, 1985, Mol. Cell Biol, 5, 2770). The Flp recombmase binds to FRT-sites 
(Flp Recognition Target) within the inverted repeats to mediate site-specific 
recombination, which is essential for natural plasmid amplification and control of 
plasmid copy number in vivo (Senecoff et al, 1985, Proc. Natl Acad. Set U.S.A., 

30 82, 7270; Jayaram, 1985, Proc, Natl Acad. Set U.S.A., 82, 5875). The copy 
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number of 2|Lim-faixiily plasmids can be significantly affected by changes in Flp 
recombinase activity (Sleep et ak 2001, Yeast, 18, 403; Rose & Broach, 1990, 
Methods Enzymol, 185, 234). The Repl and Rep2 proteins mediate plasmid 
segregation, although their mode of action is unclear (Sengupta et aU 2001, J. 
5 Bacteriol, 183, 2306). They also repress transcription of the FLP gene (Reynolds 
etal 19S7,MoL Cell Biol, 7, 3566). 

The FLP and REP2 genes of the 2iLun plasmid are transcribed from divergent 
promoters, with apparently no intervening sequence defined between them. The 
10 FLP and REP2 transcripts both terminate at the same sequence motifs within the 
inverted repeat sequences, at 24-bp and 178-bp respectively after their translation 
teimination codons (Sutton & Broach, 1985, Mol Cell Biol, 5, 2770). 

In the case of FLP, the C-terminal coding sequence also lies within the inverted 
15 repeat sequence. Furthermore, the two inverted repeat sequences are highly 
conserved over 599-bp, a feature considered advantageous to efficient plasmid 
repUcation and amplification in vivo, although only the FRT-sites (less than 65- 
bp) are essential for site-specific recombination in viti^o (Senecoff et al, 1985, 
Proc, Natl Acad Set U.S.A., 82, 7270; Jayaram, 1985, Proc. Natl Acad. Set 
20 U.S.A., 82, 5875; Meyer-Leon et al, 1984, Cold Spring Harbor Symposia On 
Quantitative Biology, 49, 797). The key catalytic residues of Flp are arginine-308 
and tyroskie-343 (which is essential) with strand-cutting facihtated by histidine- 
309 and histidine 345 (Prasad et al, 1987, Proc. Natl Acad. Set U.S.A., 84, 2189; 
Chen et al, 1992, Cell, 69, 647; Grainge et al, 2001, J. Mol Biol, 314, 717). 

25 

Two functional domains are described in Rep2. Residues 15-58 form a Repl- 
binding domain, and residues 59-296 contain a self-association and STB-bindmg 
region (Sengupta et al, 2001, J. Bacteriol, 183, 2306). 
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Chimeric or large deletion mutant derivatives of 2^m which lack many of the 
essential functional regions of the 2|ani plasmid but retain the functional cis 
element ARS and STB, cannot effectively partition between mother and daughter 
cells at cell division. Such plasmids can do so if these functions are supplied m 
5 tj^ans, by for instance the provision of a functional 2^m plasmid withm the host, 
such as a [cir*^] host. 

Genes of interest have previously been inserted into the UL-region of the 2\xm 
plasmid. For example, see plasmid pSACSUl in EP 0 286 424 and the plasmid 

10 shown in Figure 2, which includes a p-lactamase gene (for ampicillin resistance), 
a LEU2 selectable marker and an oligonucleotide Unker, the latter two of which 
are inserted into a unique SnaBI-site within the UL-region of the 2|Lmi-like 
disintegration vector, pSAC3 (see EP 0 286 424). The E. coli DNA between the 
JH?aI-sites that contains the ampicillin resistance gene is lost from die plasmid 

15 shown in Figure 2 after transformation into yeast. This is described in Chinery & 
Hinchliffe, 1989, Curr, Genet, 16, 21 and EP 0 286 424, where these types of 
vectors are designated "disintegration vectors". Further polynucleotide insertions 
can be made in a JVofl-site within a linker (Sleep et al, 1991, Biotechnology (N Y), 
9, 183). 

20 

Alternative insertion sites in 2\xm plasmid are known in the art, including those 
described in Rose & Broach (1990, Methods Enzymol, 185, 234-279), such as 
plasmids pCV19, pCV2p, CVneo, which utilise an insertion at EcoKl in FLP, 
plasmids pCV21, pGT41 and pYE which utihse EcoRL in i) as the insertion site, 

25 plasmid pHKB52 which utilises Pstl in D as the insertion site, plasmid pJDB248 
which utilises an kisertion at Pstl in D and EcoRL in D, plasmid pJDB219 in 
which. Pstl in D and EcoRI in FLP are used as insertion sites, plasmid G18, 
plasmid pAB18 which utilises an hiseition at CM in FLP, plasmids pGT39 and 
pA3, plasmids pYTll, pYT14 and pYTll-LEU which use Pstl in D as the 

30 insertion site, and plasmid PTY39 which uses EcoBl in FLP as the insertion site. 
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Other 2pxii plasmids include pSAC3, pSACSUl, pSAC3U2, pSACSOO, pSAC3 10, 
pSAC3Cl, pSAC3PLl, pSAC3SL4, and pSAC3SCl are described in EP 0 286 
424 and Chinery & Hinchliffe (1989, Curr. Genet, 16, 21-25) which also 
described Pstl, Eagl or SnaBI as appropriate 2p,m insertion sites. Further 2\xm 
plasmids include pAYE255, pAYE316, pAYE443, pAYE522 (Kerry- Williams et 
al 1998, YeasU 14, 161-169), pDB2244 (WO 00/44772), andpAYE329 (Sleep et 
ah 2001, YeasU 18, 403-421). 

In one preferred embodiment, one or more genes are inserted into a 2|Ltm-family 
plasmid within an untranscribed region around the ARS sequence. For example, 
in the 2|Lun plasmid obtained from S. cerevisiae, the untranscribed region around 
the ARS sequence extends from end of the D gene to the beguming of ARS 
sequence. Insertion into SnaBI (near the origin of repUcation sequence ARS) is 
described in Chinery & Hmchliffe, 1989, Curr. Genet, 16, 21-25. The skilled 
person will appreciate that gene insertions can also be made m the imtranscribed 
region at neighbouring positions to the SnaBI site described in Chinery & 
Hinchliffe. 

In another preferred embodiment, REP2 and FLP genes in a 2}xm-family plasmid 
each have an inverted repeat adjacent to them, and one or more genes are inserted 
into a 2|Ltm-family plasmid within the region between the first base after the last 
fimctional codon of either the REP2 gene or the FLP gene and the last base before 
the FRT site in the inverted repeat adjacent to said gene. The last functional 
codon of either a REP 2 gene or a FLP gene is the codon in the open readuig frame 
of the gene that is fijrthest downstream from the promoter of the gene whose 
replacement by a stop codon will lead to an unacceptable loss of multicopy 
stability of the plasmid, as defined herein. Thus, disruption of the KEP2 or FLP 
genes at any point downstream of the last fimctional codon in either gene, by 
insertion of a polynucleotide sequence insertion, deletion or substitution will not 
lead to an unacceptable loss of multicopy stability of the plasmid. 

18 
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For example, the KEP2 gene of the 2\im plasmid caa be disrapted after codon 59 
and that the FLP gene of the 2|im plasmid can be disrupted after codon 344, each 
without a loss of multicopy stability of the plasmid. The last functional codon in 
5 equivalent genes in other 2|am-fainily plasmids can be determined routinely by 
TYiakmg mutants of the plasmids in either the FLP or REP 2 genes and following 
the tests set out herein to determine whether the plasmid retains multicopy 
stability. 

10 One can determined whether a plasmid retains multicopy stabihty using test such 
as defined in Chinery 8c Hinchliffe (1989, Curr. Genet, 16, 21-25). For yeast that 
do not grow in the non-selective media (YPD, also designated YEPD) defmed in 
Chinery & Hmchliffe (1989, Curr, Genet, 16, 21-25) other appropriate non- 
selective media might be used. Plasmid stability may be defined as the percentage 

15 cells remaining prototrophic for the selectable marker after a defined number of 
generations. The number of generations will preferably be sufficient to show a 
difference between a control plasmid, such as pSAC35 or pSACSlO, or to shown 
comparable stability to such a control plasmid. The number of generations may 
be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 

20 45, 50, 60, 70, 80, 90, 100 or more. Higher numbers are preferred. The 
acceptable plasmid stabihty might be 1%, 2%, 3%, 4%, 5%, 10%, 15%, 20%, 
25%, 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 
99%, 99.9% or substantially 100%. Higher percentages are preferred. The skilled 
person will appreciate that, even though a plasmid may have a stability less than 

25 100% when grown on non-selective media, that plasmid can still be of use when 
cultured in selective media. For example plasmid pDB2711 as described in the 
examples is only 10% stable when the stability is detemuned accordmgly to test of 
Example 1, but provides a 15-fold increase in recombinant transferrin productivity 
in shake flask culture under selective growth conditions. 

30 
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Thus one or more gene insertions may occur between the first base after the last 
functional codon of the REP2 gene and the last base before the FRT site in an 
inverted repeat adjacent to said gene, more preferably between the first base of the 
inverted repeat and the last base before the FRT site, even more preferably at a 
5 position after the translation termination codon of the REP2 gene and before the 
last base before the FRT site. 

Additionally or alternatively one or more gene insertions may occur between the 
first base after the last functional codon of the FLP gene and the last base before 
10 the FRT site in an inverted repeat adjacent to said gene, preferably between the 
first base of the inverted repeat and the last base before the FRT site, more 
preferably between the first base after the end of the FLP coding sequence and the 
last base before the FRT site, such as at the first base after the end of the FLP 
coding sequence. 

15 

In one preferred embodiment, where the 2^m-faniiiy plasmid is based on the 2|am 
plasmid of S. cerevisiae, it is a disintegration vector as known in the art (for 
example, see EP 286 424, the contents of which are incorporated herein by 
reference). A disintegration vector may be a 2^m plasmid vector comprising a 
20 DNA sequence which is intended to be lost by recombination, three 2|xm FRT 
sites, of which one pair of sites is in dkect orientation and the other two pairs are 
in indirect orientation, and a DNA sequence of interest (such as an E. coli origin 
of repHcation and bacterial selectable marker), the said sequence to be lost being 
located between the said sites which are in direct orientation. 

25 

Thus, the sequence to be lost may comprise a selectable marker DNA sequence. 

A preferred disintegration vector comprises a complete 2|im plasmid additionally 
carrying (i) a bacterial plasmid DNA sequence necessary for propagation of the 
30 vector m a bacterial host; (ii) an extra 2|am FRT site; and a selectable marker 
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DNA sequence for yeast transformation; the said bacterial plasmid DNA sequence 
being present and the extra FRT site being created at a restriction site, such as 
Xbal, in one of the two inverted repeat sequences of the 2|am plasmid, the said 
extra FRT site being in direct orientation in relation to the endogenous FRT site of 
5 the said one repeat sequence, and the bacterial plasmid DNA sequence being 
sandwiched between the extra FRT site and the endogenous FRT site of the said 
one repeat sequence. In a preferred disintegration vector, all bacterial plasmid 
DNA sequences are sandwiched as said. A particularly preferred 2|Ltm plasmid 
vector has substantially the configuration of pSAC3 as shown in EP 286 424. 

10 

The term "disintegration vector" as used herein also includes plasmids as defined 
in US 6,451,559, the contents of which are mcorporated herein by reference. Thus 
a disintegration vector may be a 2|im vector that, other than DNA sequence 
encoding non-yeast polypeptides, contains no bacterial (particularly E. coli) origin 
15 of repUcation, or more preferably no bacterial (particularly E. coli) sequence and 
preferably all DNA in said vector, other than DNA sequence encoding non-yeast 
polypeptides, is yeast-derived DNA. 

The term "chaperone" as used herein refers to a protein that binds to and stabilises 
20 an otherwise unstable conformer of another protein, and by controlled binding and 
release, facilitates its correct fate in vivo, be it foldmg, oligomeric assembly, 
transport to a particular subcellular compartment, or disposal by degradation. 
Accordingly a chaperone is also a protein that is involved in protein folding, or 
which has chaperone activity or is involved m the unfolded protein response. 
25 Chaperone proteins of this type are Icnown in the art, for example in the Stanford 
Genome Database (SGD), http:://db.yeastgenome.org. Preferred chaperones are 
eukaryotic chaperones, especially preferred chaperones are yeast chaperones, 
including ^i, CCT2, CCT3, CCT4, CCT5, CCT6, CCT7, CCT8, CNSI, CPR3, 
CPR6, EROh EUGl, FMOl, HCHl, HSPIO, HSP12, HSP104, HSP26, HSP30, 
30 HSP42, HSP60, HSP78, HSP82, JEMl, MDJl, MDJ2, MPDl, MPD2> PDIl, 
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PFDU ABCl, APJU ATPIU ATP12, BTTl, CDC37, CPR7, HSC82, KAR2, LHSh 
MGEU MRSIU NOBh ECMIO, SSAl, SSA2, SSA3, SSA4, SSCl, SSE2, SILl, 
SLSh ORMh ORM2, PERI, PTC2, PSEl UBI4 and HACl or a truncated 
intronless HACl (Valkonen et al 2003, Applied Environ. Micro., 69, 2065) 

A chaperone useful in the practice of the present invention may be: 

• a heat shock protein, such as a protein that is a member of the hsp70 
family of proteins (including Kar2p, SSA and SSB proteins, for example 
proteins encoded by SSAl, SSA2, SSA3, SSA4, SSBl and SSB2\ a protein 
that is a member of the HSP90-family, or a protein that is a member of 
the HSP40-family or proteins involved in their modulation (e.g. 
Sillp), including DNA-J and DNA-J-like proteins (e.g. Jemlp, 
Mdj2p); 

• a protein that is a member of the karyopherin/importin family of protein's, 
such as the alpha or beta families of kajryopherin/importm proteins, for 
example the karyopherin beta protein PSEl ; 

• a protein that is a member of the ORMDL family described by Hjehnqvist 
et al, 2002, Genome Biology, 3(6), research0027. 1-0027. 16, such as 
Orm2p. 

• a protein that, is naturally located in the endoplasmic reticulum or 
elsewhere in the secretory pathway, such as the golgi. For example, a 
protein that naturally acts in the lumen of the endoplasmic reticulum (ER), 
particularly in secretory cells, such as PDI 
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a protein that is transmembrane protein anchored in the ER, such as a 
member of the ORMDL family described by Hjelmqvist et ak 2002, supra, 
(for example, Orm2p); 

a protein that acts in the cytosol, such as the hsp70 proteins, including SSA 
and SSB proteins, for example protein production SSAl, SSA2, SSA3, 
SSA4, SSBl and SSB2\ 

a protein that acts in the nucleus, the nuclear envelope and/or the 
cytoplasm, such as Pselp; 

a protein that is essential to liie viability of the cell, such as PDI or an 
essential karyopherin protein, such as Pselp; 

a protein that is involved in sulphydryl oxidation or disxilphide bond 
formation, breakage or isomerization, or a protein that catalyses 
thiokdisulphide interchange reactions in proteins, particularly during the 
biosynthesis of secretory and cell surface proteins, such as protein 
disulphide isomerases (e.g. Pdilp, Mpdlp), homologues (e.g. Euglp) 
and/or related proteins (e.g. Mpd2p, Fmolp, Erolp); 

a protein that is involved in protein synthesis, assembly or folding, such as 
PDI and Ssalp; 

a protein that binds preferentially or exclusively to unfolded, rather than 
mature protein, such as the hsp70 proteins, including SSA and SSB 
proteins, for example proteins encoded by SSAl, SSA2, SSA3, SSA4, SSBl 
and SSB2\ 
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a protein that prevents aggregation of precursor proteins in the cytosol, 
such as the hsp70 proteins, including SSA and SSB proteins, for example 
proteins encoded by SSAI, SSA2, SSA3, SSA4, SSBl and SSB2\ 

t 

a protein that binds to and stabilises damaged proteins, for example Ssalp; 

a protein that is involved in the unfolded protein response or provides 
for increased resistance to agents (such as tunicamycin and dithiothreitol) 
that induce the unfolded protein response, such as a member of the 
ORMDL family described by Hjelmqvist et al, 2002, supra (for example, 
Onn2p) or proteins involved in the response to stress (e.g. Ubi4p); 

a protein that is a co-chaperone and/or a protein indirectly involved 
in protein folding and/or the unfolded protein response (e.g. 
hspl04p, Mdjlp); 

a protein that is ravolved in the nucleocytoplasmic transport of 
macromolecules, such as Pselp; 

a protein that mediates the transport of macromolecules across the nuclear 
membrane by recognising nuclear location sequences and nuclear export 
sequences and interacting with the nuclear pore complex, such as PSEl; 

a protein that is able to reactivate ribonuclease activity against RNA of 
scrambled ribonuclease as described m as described in EP 0 746 611 and 
ffillson et ah 1984, Methods Enzymol, 107, 281-292, such as PDI; 

a protein that has an acidic pi (for example, 4.0-4.5), such as PDI; 
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a protein that is a member of the HspVO family, and preferably possesses 
an N-terminal ATP-binditig domain and a C-terminal peptide-binding 
domain, such as Ssalp. 

a protein that is a peptidyl-prolyl cis-trans isomerases (e.g. CprSp, 
Cpr6p); 

a protein that is a homologue of known chaperones (e.g. HsplOp); 

a protein that is a mitochondrial chaperone (e.g CprSp); 

a protein that is a cytoplasmic or nuclear chaperone (e.g Cnslp); 

a protein that is a membrane-bound chaperone (e.g. Orm2p, Fmolp); 

a protein that has chaperone activator activity or chaperone regulatory 
activity (e.g. Ahalp, Haclp, Hchlp); 

a protein that transiently binds to polypeptides in their immature form 
to cause proper folding transportation and/or secretion, including 
proteins required for efficient translocation into the endoplasmic 
reticulum (e.g. Lhslp) or their site of action within the cell (e.g. 
Pselp); 

a protein that is a involved in proteia complex assembly and/or 
ribosome assembly (e.g. Atpllp, Pselp, Noblp); 

a protein of the chaperonin T-complex (e.g. Cct2p); or 

25 
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a protein of the prefoldin complex (e.g. Pfdlp). 



A preferred chaperone is protein disulphide isomerase (PDI) or a fragment or 
5 variant thereof having an equivalent ability to catalyse the formation of disulphide 
bonds within the lumen of the endoplasmic reticulum (ER). By "PDI" we include 
any protein having the ability to reactivate the ribonuclease activity against RNA 
of scrambled ribonuclease as described ia EP 0 746 611 and Hillson et al, 1984, 
Methods EnzymoL, 107, 281-292. 

10 

PDI is an en2yme which typically catalyzes thiol: disulphide interchange reactions, 
and is a major resident protein component of the ER lumen in secretory cells. A 
body of evidence suggests that it plays a role in secretory protein biosynthesis 
(Freedman, 1984, Trends Biochem, Sci., 9, 438-41) and this is supported by dnect 
15 cross-linking studies in situ (Roth and Pierce, 1987, Biochemistiy, 26, 4179-82). 

r 

The finding that microsomal membranes deficient m PDI show a specific defect in 
cotranslational protein disulphide (Bulleid and Freedman, 1988, Nature, 335, 649- 
51) implies that the enzyme functions as , a catalyst of native disulphide bond 
formation during the biosynthesis of secretory and cell surface proteins. This role 

20 is consistent with what is known of the enzyme's catalytic properties in vitj^o; it 
catalyzes thiol: disulphide interchange reactions leading to net protein disulphide 
formation, breakage or isomerization, and can typically catalyze protein folding 
and the formation of native disulphide bonds in a wide variety of reduced, 
unfolded protein substrates (Freedman et aL, 1989, Biochem. Soc. Symp,, 55, 167- 

25 192). PDI also functions as a chaperone since mutant PDI lacking isomerase 
activity accelerates protein foldmg (Hayano et al, 1995, FEBS Letters, 311, 505- 
511). Recently, sulphydryl oxidation, not disulphide isomerisation was reported 
to be the principal function of Protem Disulphide Isomerase in S. cerevisiae 
(Solovyov et al, 2004, J. Biol. Chem., 279 (33) 34095-34100). The DNA and 

30 amino acid sequence of the enzyme is known for several species (Scherens et al. 
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1991, Yeast, 7, 185-193; Farquhar et ah 1991, Gene, 108, 81-89; EP074661; 
EP0293793; EP0509841) and there is increasing information on the mechanism of 
action of the enzyme purified to homogeneity from mammalian liver (Creighton et 
ah 1980, J. Mol Biol, 142, 43-62; Freedman et ah 1988, Biochem, Soc. Trans., 
5 16, 96-9; Gilbert, 1989, Biochemistiy, 28, 7298-7305; Lundstrom and Hohngren, 
1990, J. Biol Chem,, 265, 9114-9120; Hawldns and Freedman, 1990, Biochem. J, 
275, 335-339). Of the many protein factors currently implicated as mediators of 
protein folding, assembly and translocation in the cell (Rothman, 1989, Celh 59, 
591-601), PDI has a weU-defined catalytic activity. 

10 

The deletion or inactivation of the endogenous PDI gene in a host results in the 
production of an inviable host In other words, the endogenous PDI gene is an 
"essential" gene. 

15 PDI is readily isolated from mammalian tissues and the homogeneous enzyme is a 
homodimer (2x57 kD) with characteristically acidic pi (4.0-4.5) (Hillson et ah 
1984, op. cit.). The enzyme has also been purified from wheat and from the alga 
Chlamydomonas reinhardii (Kaska et ah 1990, Biochem. J., 268, 63-68), rat 
(Edman et ah 1985, Nature, 317, 267-270), bovine (Yamauchi et ah 1987, 

20 Biochem. Biophys, Res. Comm., 146, 1485-1492), human (Pihlajaniemi et ah 
1987, EMBO J., 6, 643-9), yeast (Scherens et ah supra; Farquhar et ah op. cit.) 
and chick (Parkkonen et ah 1988, Biochem. J., 256, 1005-1011). The proteins 
from these vertebrate species show a high degree of sequence conservation 
throughout and all show several overall features first noted in the rat PDI sequence 

25 (Edman et al, 1985, op. cit). 

Preferred PDI sequences include those from humans and those from yeast species, 
such as S. cerevisiae. 
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A yeast protein disulphide isomerase precursor, PDIl, can be found as Genbank 
accession no. CAA42373 or BAA00723. It has the foUowing sequence of 522 
amino acids: 



5 1 mkfsagavls wsslllassv faqqeavape 

61 cghcknmape yvkaaetlve knitlaqidc 

121 dyegprtaea ivqfmikqsq pavavvadlp 

181 ankhfndydf vsaenadddf klsiylpsam 

241 fgeidgsvfa qyvesglplg ylfyndeeel 

10 301 hagnlnmkeq fplfaihdmt edlkyglpql 

361 aspivksqei fenqdssvfq Ivgknhdeiv 

421 dtyanatsdv liakldhten dvrgvviegy 

4 81 kenghfdvdg kalyeeaqek aaeeadadae 



dsavvklatd sfneyiqshd Ivlaeffapw 
tenqdlcmeh nipgfpslki fknsdvnnsi 
aylanetfvt pvivqsgkid adfnatfysm 
depwyngkk adiadadvfe kwlqvealpy 
eeykplftel akknrglmnf vsidarkfgr 
seeafdelsd kivleskaie slvkdflkgd 
ndpkkdvlvl yyapwcghck rlaptyqela 
ptivlypggk ksesvvyqgs rsldslfdfi 
ladeedaihd el 



15 An alternative yeast protein disulphide isomerase sequence can be found as 
Genbank accession no. CAA38402. It has the following sequence of 530 amino 
acids 

1 mkfsagavls wsslllassv faqqeavape dsavvklatd sfneyiqshd Ivlaeffapw 
20 61 cghcknmape yvkaaetlve knitlaqidc tenqdlcmeh nipgfpslki fknrdvnnsi 
121 dyegprtaea ivqfmikqsq pavavvadlp aylanetfvt pvivqsgkid adfnatfysm 
181 ankhfndydf vsaenadddf klsiylpsam depwyngkk adiadadvfe kwlqvealpy 
241 fgeidgsvfa qyvesglplg ylfyndeeel eeykplftel akknrglmnf vsidarkfgr 
301 hagnlnmkeq fplfaihdmt edlkyglpql seeafdelsd kivleskaie slvkdflkgd 
25 361 aspivksqei fenqdssvfq Ivgknhdeiv ndpkkdvlvl yyapwcghck rlaptyqela 
421 dtyanatsdv liakldhten dvrgvviegy ptivlypggk ksesvvyqgs rsldslfdfi 
481 kenghfdvdg kalyeeaqek aaeeaeadae aeadadaela deedaihdel 

The following aUgmnent of these sequences (the sequence of Genbank accession 
30 no. CAA42373 or BAA00723 first, the sequence of Genbank accession no. 
CAA38402 second) shows that the differences between these two sequences are a 
single amino acid difference at position 114 (highlighted in bold) and that the 
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sequence defined by Genbank accession no. CAA38402 contains the additional 
amino acids EADAEAEA at positions 506-513. 

1 mkfsagavls wsslllassv faqqeavape dsavvklatd sfneyiqshd Ivlaeffapw 
5 1 mkfsagavls wsslllassv faqqeavape dsavvklatd sfneyiqshd Ivlaeffapw 

61 cghcknmape yvkaaetlve knitlaqidc tenqdlcmeh nipgfpslki fknsdvnnsi 
61 cghcknmape yvkaaetlve knitlaqidc tenqdlcmeh nipgfpslki fknrdvnnsi 

10 121 dyegprtaea ivqfmikqsq pavavvadlp aylanetfvt pvivqsgkid adfnatfysm 
181 dyegprtaea ivqfmikqsq pavavvadlp aylanetfvt pvivqsgkid adfnatfysm 

181 ankhfndydf vsaenadddf klsiylpsam depvvyngkk adiadadvfe kwlqvealpy 
181 ankhfndydf vsaenadddf klsiylpsam depvvyngkk adiadadvfe kwlqvealpy 

15 

241 fgeidgsvfa qyvesglplg ylfyndeeel eeykplftel akknrglmnf vsidarkfgr 
241 fgeidgsvfa qyvesglplg ylfyndeeel eeykplftel akknrglmnf vsidarkfgr 

301 hagnlnmkeq fplfaihdmt edlkyglpql seeafdelsd kivleskaie slvkdflkgd 
20 301 hagnlnmkeq fplfaihdmt edlkyglpql seeafdelsd kivleskaie slvkdflkgd 

3 61 aspivksqei fenqdssvfq Ivgknhdeiv ndpkkdvlvl yyapwcghck rlaptyqela 

3 61 aspivksqei fenqdssvfq Ivgknhdeiv ndpkkdvlvl yyapwcghck rlaptyqela 

25 421 dtyanatsdv liakldhten dvrgvviegy ptivlypggk ksesvvyqgs rsldslfdfi 
421 dtyanatsdv liakldhten dvrgvviegy ptivlypggk ksesvvyqgs rsldslfdfi 

4 81 kenghfdvdg kalyeeaqek aaeea***** ***dadaela deedaihdel 
481 kenghfdvdg kalyeeaqek aaeeaeadae aeadadaela deedaihdel 

30 

Variants and fragments of the above PDI sequences, and Variants of other naturally 
occurring PDI sequences are also included in the present invention. A "variant", iti 
the context of PDI, refers to a protein wherein at one or more positions there have 
been amino acid insertions, deletions, or substitutions, either conservative or non- 
35 conservative, provided that such changes result in a protein whose basic properties, 

« 
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for example enzymatic activity (type of and specific activity), thermostability, 
activity in a certain pH-range (pH-stability) have not significantiy been changed. 
"Significantly" in this context means tiiat one skilled in the art would say that the 
properties of the variant may still be different but would not be unobvious over the 
5 ones of the original protein. 

By "conservative substitutions" is intended combinations such as Val, lie. Leu, Ala, 
Met; Asp, Glu; Asn, Gin; Ser, Thr, Gly, Ala; Lys, Arg, His; and Phe, Tyr, Trp. 
Preferred conservative substitutions include Gly, Ala; Val, lie. Leu; Asp, Glu; Asn, 
10 Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. 

A 'Variant" typically has at least 25%, at least 50%, at least 60% or at least 70%, 
preferably at least 80%, more preferably at least 90%, even more preferably at least 
95%, yet more preferably at least 99%, most preferably at least 99.5% sequence 
1 5 identity to the polypeptide &om. which it is derived. 

The percent sequence identity between two polypeptides may be determined using 
suitable computer programs, as discussed below. Such variants may be natural or 
made using the methods of protein engineering and site-directed mutagenesis as are 
20 well known in the art. 

A "fragment", in the context of PDI, refers to a protein whereui at one or more 
positions there have been deletions. Thus the fragment may comprise at most 5, 10, 
20, 30, 40 or 50%, typically up to 60%, more typically up to 70%*, preferably up to 
25 80%, more preferably up to 90%, even more preferably up to 95%, yet more 
preferably up to 99% of the complete sequence of the foil mature PDI protein. 
Particularly preferred fragments of PDI protein comprise one or more whole 
domains of the desired protein. 

30 A fragment or variant of PDI may be a protein that, when expressed 

recombinantly in a host cell, can complement the deletion of the endogenously 
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encoded PDI gene in the host cell, such as S. cerevisiae, and may, for example, be 
a naturally occurring homolog of PDI, such as a homolog encoded by another 
organism, such as another yeast or other fungi, or another eukaryote such as a 
human or other vertebrate, or animal or by a plant. 

Another preferred chaperone is SSAl or a fragment or variant thereof having an 
equivalent chaperone-Uke activity. SSAJ, also known as YGIOO, is located on 
chromosome I of the S. cerevisiae genome and is 1.93-kbp m size. 

One published protein sequence of SSAl is as follows: 

MSKAVGIDLGTTYSCVAHFANDRVDIIANDQGNRTTPSFVAFTDTERLIGDAAKNQAAMN 
PSNTVFDAKRLIGRNFNDPEVQADMKHFPFKLIDVDGKPQIQVEFKGETKNFTPEQISSM 
VLGKMKETAESYLGAKVNDAWTVPAYFNDSQRQATKDAGTIAGLNVLRIINEPTAAAIA 
YGLDKKGKEEHVLIFDLGGGTFDVSLLFIEDGIFEVKATAGDTHLGGEDFDNRLVNHFIQ 
EFKRKNKKDLSTNQRALRRLRTACERAKRTLSSSAQTSVEIDSLFEGIDFYTSITRARFE 
ELCADLFRSTLDPVEKVLRDAKLDKSQVDEIVLVGGSTRIPKVQKLVTDYFNGKEPNRSI 
NPDEAVAYGAAVQAAILTGDESSKTQDLLLLDVAPLSLGIETAGGVMTKLIPRNSTISTK 
KFEIFSTYADNQPGVLIQVFEGERAKTKDNNLLGKFELSGIPPAPRGVPQIEVTFDVDSN 
GILNVSAVEKGTGKSNKITITNDKGRLSKEDIEKMVAEAEKFKEEDEKESQRIASKNQLE 
SIAYSLKNTISEAGDKLEQADKDTVTKKAEETISWLDSNTTASKEEFDDKLKELQDIANP 
IMSKLYQAGGAPGGAAGGAPGGFPGGAPPAPEAEGPTVEEVD 

A published coding sequence for SSAl is as follows, although it will be 
appreciated that the sequence can be modified by degenerate substitutions to 
obtain alternative nucleotide sequences which encode an identical protein product: 



ATGTCAAAAGCTGTCGGTATTGATTTAGGTACAACATACTCGTGTGTTGCTCACTTTGCT 
AATGATCGTGTGGACATTATTGCCAACGATCAAGGTAACAGAACCACTCCATCTTTTGTC 
GCTTTCACTGACACTGAAAGATTGATTGGTGATGCTGCTAAGAATCAAGCTGCTATGAAT 
CCTTCGAATACCGTTTTCGACGCTAAGCGTTTGATCGGTAGAAACTTCAACGACCCAGAA 
GTGCAGGCTGACATGAAGCACTTCCCATTCAAGTTGAXCGATGTTGACGGTAAGCCTCAA 
ATTCAAGTTGAATTTAAGGGTGAAACCAAGAACTTTACCCCAGAACAAATCTCCTCCATG 
GTCTTGGGTAAGATGAAGGAAACTGCCGAATCTTACTTGGGAGCCAAGGTCAATGACGCT 
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GTCGTCACTGTCCCAGCTTACTTCAACGATTCTCAAAGACAAGCTACCAAGGATGCTGGT 
ACCATTGCTGGTTTGAATGTCTTGCGTATTATTAACGAACCTACCGCCGCTGCCATTGCT 
TACGGTTTGGACAAGAAGGGTAAGGAAGAACACGTCTTGATTTTCGACTTGGGTGGTGGT 
ACTTTCGATGTCTCTTTGTTGTTCATTGAAGACGGTATCTTTGAAGTT7UVGGCCACCGCT 
5 GGTGACACCCATTTGGGTGGTGAAGATTTTGACAACAGATTGGTCAACCACTTCATCCAA 
GAATTCAAGAGAAAGAACAAGAAGGACTTGTCTACCAACCAAAGAGCTTTGAGAAGATTA 
AGAACCGCTTGTGAAAGAGCCAAGAGAACTTTGTCTTCCTCCGCTCAAACTTCCGTTGAA 
ATTGACTCTTTGXTCGAAGGTATCGATTTCTACACTTCCATCACCAGAGCCAGATTCGAA 
GAATTGTGTGCTGACTTGTTCAGATCTACTTTGGACCCAGTTGAAAAGGTCTTGAGAGAT - 

10 GCTAAATTGGACAAATCTCAAGTCGATGAAATTGTCTTGGTCGGTGGTTCTACCAGAATT 
CCAAAGGTCCAAAAATTGGTCACTGACTACTTCAACGGTAAGGAACCAAACAGATCTATC 
' AACCCAGATGAAGCTGTTGCTTACGGTGCTGCTGTTCAAGCTGCTATTTTGACTGGTGAC 
GAATCTTCCAAGACTCAAGATCTATTGTTGTTGGATGTCGCTCCATTATCCTTGGGTATT 
GAAACTGCTGGTGGTGTCATGACCAAGTTGATTCCAAGAAACTCTACCATTTCAACAAAG 

15 AAGTTCGAGATCTTTTCCACTTATGCTGATAACCAACCAGGTGTCTTGATTCAAGTCTTT 
GAAGGTGAAAGAGCCAAGACTAAGGACAACAACTTGTTGGGTAAGTTCGAATTGAGTGGT 
ATTCCACCAGCTCCAAGAGGTGTCCCACAAATTGAAGTCACTTTCGATGTCGACTCTAAC 
GGTATTTTGAATGTTTCCGCCGTCGAAAAGGGTACTGGTAAGTCTAACAAGATCACTATT 
ACCAACGACAAGGGTAGATTGTCCAAGGAAGATATCGAAAAGATGGTTGCTGAAGCCGAA 

20 AAATTCAAGGAAGAAGATGAAAAGGAATCTCAAAGAATTGCTTCCAAGAACCAATTGGAA 
TCCATTGCTTACTCTTTGAAGAACACCATTTCTGAAGCTGGTGACAAATTGGAACAAGCT 
GACAAGGACACCGTCACCAAGAAGGCTGAAGAGACTATTTCTTGGTTAGACAGCAACACC 
ACTGCCAGCAAGGAAGAATTCGATGACAAGTTGTy^GGAGTTGCAAGACATTGCCTVACCCA 
ATCATGTCTAAGTTGTACCAAGCTGGTGGTGCTCCAGGTGGCGCTGCAGGTGGTGCTCCA 

25 GGCGGTTTCCCAGGTGGTGCTCCTCCAGCTCCAGAGGCTGAAGGTCCAACCGTTGAAGAA 

GTTGATTAA 

The protein Ssalp belongs to the Hsp70 family of proteins and is resident in the 
cytosol. Hsp70s possess Ihe ability to perform a number of chaperone activities; 

30 aiding protein synthesis, assembly and folding; mediating translocation of 
polypeptides to various intracellular locations, and resolution of protein 
aggregates (Becker & Craig, 1994, Eur, J, Biochem. 219, 11-23). Hsp70 genes 
are highly conserved, possessing an N-terminal ATP-buiding domain and a C- 
terminal peptide-bindiag domain. HspTO proteins interact with the peptide 

35 backbone of, mainly unfolded, proteins. The binding and release of peptides by 
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hsp70 proteins is an ATP-dependent process and accompanied by a 
conformational change in the hsp70 (Becker & Craig, 1994, supra). 

C3^osolic hsp70 proteins are particularly involved in the synthesis, folding and 
5 secretion of proteins (Becker & Craig, 1994, supra). In S. cerevisiae cytosolic 
hsp70 proteins have been divided into two groups; SSA (SSA 1-4) and SSB (SSB 
1 and 2) proteins, which are functionally distinct from each other. The SSA 
family is essential in that at least one protein from the group must be active to 
maintain cell viability (Becker & Craig, 1994, supra). Cytosolic hsp70 proteins 

10 bind preferentially to unfolded and not mature proteins. This suggests that they 
prevent the aggregation of precursor proteins, by maintaining them in an mifolded 
state prior to being assembled into multimolecular complexes in the cytosol and/or 
facilitating their translocation to various organelles (Becker & Craig, 1994, 
supra). SSA proteins are particularly involved in posttranslational biogenesis and 

15 maintenance of precursors for translocation into the endoplasmic reticulum and 
mitochondria (Kim et aL, 1998, Proc. Natl Acad. Set USA. 95, 12860-12865; 
Ngosuwan et al, 2003, J. Biol Chem. 278 (9), 7034-7042). Ssalp has been shown 
to bind damaged proteins, stabiHsing them in a partially unfolded form and 
allowing refolding or degradation to occur (Becker & Craig, 1994, supra; Glover 

20 & Lmdquist, 1998, Cell 94, 73-82). 

Demolder et al, 1994, J, Biotechnol, 32, 179-189 reported that over-expression of 
SSAl in yeast provided for increases in the expression of a recombinant 
chromosomally integrated gene encoding human interferon-p. There is no 

25 suggestion that increases in heterologous gene expression could be achieved if 
SSAl and human interferon-P were to be encoded by recombinant genes on the 
same plasmid. In fact, in light of more recent developments in the field of over- 
expression of chaperones in yeast (e.g. Robinson et al, 1994, op. cit; Hayano et 
a/, 1995, op. cit; Shusta et ak 1998, op. cit; Parekh & Wittmp, 1997, op. cit \ Bao 

30 & Fnkuhara, 2001, op. cit; and Bao et al^ 2000, op. cit ) the skilled person would 
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have been disinclined to express SSAl from a 2|Jin-'family plasmid at all, much 
less to express both SSAl and a heterologous protein from a 2fmi-family plasmid 
in order to increase the expression levels of a heterologous protein. 

5 Variants and fragments of SSAl are also included in the present invention. A 
"variant", in the context of SSAl, refers to a protein having the sequence of native 
SSAl other than at one or more positions where there have been amino acid 
insertions, deletions, or substitutions, either conservative or non-conservative, 
provided that such changes result in a protein whose basic properties, for example 
10 enzymatic activity (type of and specific activity), thermostability, activity in a certain 
pH-range (pH-stability) have not significantly been changed. "Significantly" in this 
context means that one skilled in the art would say that the properties of the variant 
may still be different but would not be imobvious over the ones of the original 
protein. 

15 

By "conservative substitutions" is intended combinations such as Val, lie. Leu, Ala, 
Met; Asp, Glu; Asn, Gin; Ser, Thr, Gly, Ala; Lys, Arg, His; and Phe, Tyr, Tip. 
Preferred conservative substitutions include Gly, Ala; Val, lie. Leu; Asp, Glu; Asn, 
Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. 

20 

A "variant" of SSAl typically has at least 25%, at least 50%, at least 60% or at least 
70%, preferably at least 80%, more preferably at least 90%, even more preferably at 
least 95%, yet more preferably at least 99%, most preferably at least 99.5% sequence 
identity to the sequence of native SSAl. 

25 

The percent sequence identity between two polypeptides may be determined using 
suitable computer programs, as discussed below. Such variants may be natural or 
made usiag the methods of protein, engineering and site-directed mutagenesis as are 
well known in the art. 

30 
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A "fragment", in the context of SSAl, refers to a protein having the sequence of 
native SSAl other than for at one or more positions where there have been deletions. 
Thus the fragment may comprise at most 5, 10, 20, 30, 40 or 50%, typically up to 
60%, more typically up to 70%, preferably up to 80%, more preferably up to 90%», 
even more preferably up to 95%, yet more preferably up to 99% of the' complete 
sequence of the frill mature SSAl protem. Particularly preferred fragments of SSAl 
protein comprise one or more whole domains of the desired protein. 

A fragment or variant of SSAl may be a protein that, when expressed 
recombinantly in a host cell, such as 5. cerevisiae, can complement the deletion of 
the endogenously encoded SSAl gene (or homolog thereof) in the host cell and 
may, for example, be a naturally occurring homolog of SSAl, such as a homolog 
encoded by another organism, such as another yeast or otlier fungi, or another 
eukaryote such as a human or other vertebrate, or animal or by a plant. 

Another preferred chaperone is PSEl or a fragment or variant thereof having 
equivalent chaperone-like activity. 

PSEl, also known as KAP12I, is an essential gene, located on chromosome XIII. 
A pubUshed protein sequence for the protein pselp is as follows: 

MSALPEEVNRTLLQIVQAFASPDNQIRSVAEKALSEEWITENNIEYLLTFLAEQAAFSQD 
TTVAALSAVLFRKLALKAPPSSKLMIMSKNITHIRKEVLAQIRSSLLKGFLSERADSIRH 
KLSDAIAECVQDDLPAWPELLQALIESLKSGNPNFRESSFRILTTVPYLITAVDINSILP 
IFQSGFTDASDNVKIAAVTAFVGYFKQLPKSEWSKLGILLPSLLNSLPRFLDDGKDDALA 
SVFESLIELVELAPKLFKDMFDQIIQFTDMVIKNKDLEPPARTTA1.ELLTVFSENAPQMC 
KSNQNYGQTLVMVTLIMMTEVSIDDDDAAEWIESDDTDDEEEVTYDHARQALDRVALKLG 
GEYI^PLFQYLQQMITSTEWRERFAAMMALSSAAEGCADVLIGEIPKILDMVIPLINDP 
HPRVQYGCCNVLGQISTDFSPFIQRTAHDRILPALISKLTSECTSRVQTHAAAALVNFSE 
FASKDILEPYLDSLLTNLLVLLQSNKLYVQEQALTTIAFIAEAAKNKFIKYYDTLMPLLL 
NVLKVNNKDNSVLKGKCMECATLIGFAVGKEKFHEHSQELISILVALQNSDIDEDDALRS 
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YLEQSWSRICRILGDDFVPLLPIVIPPLLITAKATQDVGLIEEEEAANFQQYPDWDWQV 
QGKHIAIHTSVLDDKVSAMELLQSYATLLRGQFAVYVKEVMEEIALPSLDFYLHDGVRAA 
GATLIPILLSCLLAATGTQNEELVLLWHKASSKLIGGLMSEPMPEITQVYHNSLVNGIKV 
MGDNCLSEDQLAAFTKGVSANLTDTYERMQDRHGDGDEYNENIDEEEDFTDEDLLDEINK 
SIAAVLKTTNGHYLKNLENIWPMINTFLLDNEPILVIFALVVIGDLIQYGGEQTASMKNA 
FIPKVTECLISPDARIRQAASYIIGVCAQYAPSTYADVCIPTLDTLVQIVDFPGSKLEEN 
RSSTENASAAIAKILYAYNSNIPNVDTYTANWFKTLPTITDKEAASFNYQFLSQLIENNS 
PIVCAQSNISAWDSVIQALNERSLTEREGQTVISSVKKLLGFLPSSDAMAIFNRYPADI 

MEKVHKWFA* 

A published nucleotide coding sequence of PSEl is as foUows, although it will be 
appreciated that the sequence can be modified by degenerate substitutions to 
obtain alternative nucleotide sequences which encode an identical protem product: 



ATGTCTGCTTTACCGGAAGAAGTTAATAGAACATTACTTCAGATTGTCCAGGCGTTTGCT 

TCCCCTGACAATCAAATACGTTCTGTAGCTGAGAAGGCTCTTAGTGAAGAATGGATTACC 

GAAAACAATATTGAGTATCTTTTAACTTTTTTGGCTGAACAAGCCGCTTTCTCCCAAGAT 

ACAACAGTTGCAGCATTATCTGCTGTTCTGTTTAGAAAATTAGCATTAAAAGCTCCCCCT 

TCTTCGAAGCTTATGATTATGTCCAAAAATATCACACATATTAGGAAAGAAGTTCTTGCA 

CAAATTCGTTCTTCATTGTTAAAAGGGTTTTTGTCGGAAAGAGCTGATTCAATTAGGCAC 

AAACTATCTGATGCTATTGCTGAGTGTGTTCAAGACGACTTACCAGCATGGCCAGAATTA 

CTACAAGCTTTAATAGAGTCTTTAAAAAGCGGTAACCCAAATTTTAGAGAATCCAGTTTT 

AGAATTTTGACGACTGTACCTTATTTAATTACCGCTGTTGACATCAACAGTATCTTACCA 

ATTTTTCAATCAGGCTTTACTGATGCAAGTGATAATGTCAAAATTGCTGCAGTTACGGCT 

TTCGTGGGTTATTTTAAGCAACTACCAAAATCTGAGTGGTCCAAGTTAGGTATTTTATTA 

CCAAGTCTTTTGAATAGTTTACCAAGATTTTTAGATGATGGTAAGGACGATGCCCTTGCA 

TCAGTTTTTGAATCGTTAATTGAGTTGGTGGAATTGGCACCAAAACTATTCAAGGATATG 

TTTGACCAAATAATACAATTCACTGATATGGTTATAAAAAATAAGGATTTAGAACCTCCA 

GCAAGAACCACAGCACTCGAACTGCTAACCGTTTTCAGCGAGAACGCTCCCCAAATGTGT 

AAATCGAACCAGAATTACGGGCAAACTTTAGTGATGGTTACTTTAATCATGATGACGGAG 

GTATCCATAGATGATGATGATGCAGCAGAATGGATAGAATCTGACGATACCGATGATGAA 

GAGGAAGTTACATATGACCACGCTCGTCAAGCTCTTGATCGTGTTGCTTTAAAGCTGGGT 

GGTGAATATTTGGCTGCACCATTGTTCCAATATTTACAGCAAATGATCACATCAACCGAA 

TGGAGAGAAAGATTCGCGGCCATGATGGCACTTTCCTCTGCAGCTGAGGGTTGTGCTGAT 

GTTCTGATCGGCGAGATCCCAAAAATCCTGGATATGGTAATTCCCCTCATCAACGATCCT 

CATCCAAGAGTACAGTATGGATGTTGTAATGTTTTGGGTCAAATATCTACTGATTTTTCA 
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CCATTCATTCAAAGAACTGCACACGATAGAATTTTGCCGGCTTTAATATCTAAACTAACG 

TCAGAATGCACCTCAAGAGTTCAAACGCACGCCGCAGCGGCTCTGGTTAACTTTTCTGAA 

TTCGCTTCGAAGGATATTCTTGAGCCTTACTTGGATAGTCTATTGACAAATTTATTAGTT 

T TAT T ACAAAGCAACAAAC T T T ACGTACAGGAACAGGCCCTAACAACCAT TGCAT T TAT T 

GCTGAAGCTGCAAAGAATAAATTTATCAAGTATTACGATACTCTAATGCCATTATTATTA 

AATGTTTTGAAGGTTAACAATAAAGATAATAGTGTTTTGAAAGGTAAATGTATGGAATGT 

GCAACTCTGATTGGTTTTGCCGTTGGTAAGGAAAAATTTCATGAGCACTCTCAAGAGCTG 

ATTTCTATATTGGTCGCTTTACAAAACTCAGATATCGATGAAGATGATGCGCTCAGATCA 

TACTTAGAACAAAGTTGGAGCAGGATTTGCCGAATTCTGGGTGATGATTTTGTTCCGTTG 

TTACCGATTGTTATACCACCCCTGCTAATTACTGCCAAAGCAACGCAAGACGTCGGTTTA 

ATTGAAGAAGAAGAAGCAGCAAATTTCCAACAATATCCAGATTGGGATGTTGTTCAAGTT 

CAGGGAAAACACATTGCTATTCACACATCCGTCCTTGACGATAAAGTATCAGCAATGGAG 

CTATTACAAAGCTATGCGACACTTTTAAGAGGCCAATTTGCTGTATATGTTAAAGAAGTA 

ATGGAAGAAATAGCTCTACCATCGCTTGACTTTTACCTACATGACGGTGTTCGTGCTGCA 

GGAGCAACTTTAATTCCTATTCTATTATCTTGTTTACTTGCAGCCACCGGTACTCAAAAC 

GAGGAATTGGTATTGTTGTGGCATAAAGCTTCGTCTAAACTAATCGGAGGCTTAATGTCA 

GAACCAATGCCAGAAATCACGCAAGTTTATCACAACTCGTTAGTGAATGGTATTAAAGTC 

ATGGGTGACAATTGCTTAAGCGAAGACCAATTAGCGGCATTTACTAAGGGTGTCTCCGCC 

AACTTAACTGACACTTACGAAAGGATGCAGGATCGCCATGGTGATGGTGATGAATATAAT 

GAAAATATTGATGAAGAGGAAGACTTTACTGACGAAGATCTTCTCGATGAAATCAACAAG 

TCTATCGCGGCCGTTTTGAAAACCACAAATGGTCATTATCTAAAGAATTTGGAGAATATA 

TGGCCTATGATAAACACATTCCTTTTAGATAATGAACCAATTTTAGTCATTTTTGCATTA 

GTAGTGATTGGTGACTTGATTCAATATGGTGGCGAACAAACTGCTAGCATGAAGAACGCA 

TTTATTCCAAAGGTTACCGAGTGCTTGATTTCTCCTGACGCTCGTATTCGCCTy^GCTGCT 

TCTTATATAATCGGTGTTTGTGCCCAATACGCTCCATCTACATATGCTGACGTTTGCATA 

CCGACTTTAGATACACTTGTTCAGATTGTCGATTTTCCAGGCTCCAAACTGGAAGAAAAT 

CGTTCTTCAACAGAGAATGCCAGTGCAGCCATCGCCAAAATTCTTTATGCATACAATTCC 

AACATTCCTAACGTAGACACGTACACGGCTAATTGGTTCAAAACGTTACCAACAATAACT 

GACAAAGAAGCTGCCTCATTCAACTATC7\ATTTTTGAGTCAATTGATTGAAAATAATTCG 

CCAATTGTGTGTGCTCAATCTAATATCTCCGCTGTAGTTGATTCAGTCATACAAGCCTTG 

AATGAGAGAAGTTTGACCGAAAGGGAAGGCCTIAACGGTGATAAGTTCAGTTAAAAAGTTG 

TTGGGATTTTTGCCTTCTAGTGATGCTATGGCAATTTTCAATAGATATCCAGCTGATATT 

ATGGAGAAAGTACATAAATGGTTTGCATAA 

The PSEl gene is 3.25-kbp in size. Pselp is involved in the nucleocytoplasmic 
transport of macromolecules (Seedorf & Silver, 1997, Proc, Natl Acad, Scl USA, 
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94, 8590-8595). This process occurs via the nuclear pore complex (NPC) 
embedded ia the nuclear envelope and made up of nucleoporins (Ryan & Wente, 

2000, Curr, Opin, Cell Biol 12, 361^371). Proteins possess specific sequences 
that contain the information required for nuclear import, nuclear localisation 

5 sequence (NLS) and export, nuclear export sequence (NES) (Pemberton et al, 
1998, Curr, Opin, Cell Biol 10, 392-399). Pselp is a karyopherin/importin, a 
group of proteins, which have been divided up into a and p families. 
Karyopherins are soluble transport factors that mediate the transport of 
macromolecules across the nuclear membrane by recognising NLS and NES, and 

10 iateract with and the NPC (Seedorf & Silver, 1997, supra\ Pemberton et a/., 1998, 
supra\ Ryan & Wente, 2000, supra). Translocation through the nuclear pore is 
driven by GTP hydrolysis, catalysed by the small GTP-bindiag proteia. Ran 
(Seedorf & Silver, 1997, supra). Pselp has been identified as a karyopherin p. 14 
karyopherin P proteins have been identified in S. cerevisiae, of which only 4 are 

15 essential. This is perhaps because multiple karyopherins may mediate the 
transport of a smgle macromolecule (Isoyama et al, 2001, J. Biol. Chem. 276 
(24), 21863-21869). Pselp is localised to the nucleus, at the nuclear envelope, 
and to a certam extent to the cytoplasm. This suggests the protein moves m and 
out of the nucleus as part of its transport function (Seedorf & Silver, 1997, supra). 

20 Pselp is involved m the nuclear import of transcription factors (Isoyama et al., 

2001, supra; Ueta et al, 2003, J. Biol. Chem. 27S (50), 50120-50127), histones 
(Mosammaparast et al., 2002, J. Biol. Chem. 271 (1), 862-868), and ribosomal 
proteins prior to their assembly into ribosomes (Pemberton et al, 1998, stpra). It 
also mediates the export of mRNA from the nucleus. Karyopherins recognise and 

25 bmd distinct NES found on RNA-binding proteins, which coat the RNA before it 
is exported from the nucleus (Seedorf & Silver, 1997, Pemberton et al, 1998, 
supra). 
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As nucleocytoplasmic transport of macromolecules is essential for proper 
progression through the cell cycle, nuclear transport factors, such as pselp are 
novel candidate targets for growth control (Seedorf & Silver, 1997, supra). 

5 Overexpression of Pselp (protein secretion enhancer) in S. cerevisiae has also 
been shown to increase endogenous protein secretion levels of a repertoire of 
biologically active proteins (Chow et al, 1992; J. Cell. Sci. 101 (3), 709-719). 
There is no suggestion lhat increases in heterologous gene expression could be 
achieved if PSEl and a heterologous protein were both to be encoded by 

10 recombinant genes on the same plasmid. In fact, in light of more recent 
developments in the over-expression of chaperones in yeast (e.g. Robinson et al, 
1994, op. cit.; Hayano et al, 1995, op. cit.; Shusta et al, 1998, op. cit; Parekh & 
Wittrup, 1997, op. cit; Bao & Fukuhara, 2001, op. cit; and Bao et al, 2000, op. 
cit ) the skilled person would not have attempted to over-express PSEl from a 

15 2|jm-family plasmid at all, much less to express both PSEl and a heterologous 
protein from a 2|jm-fainily plasmid in order to increase the expression levels of a 
heterologous protein. 

Variants and fragments of PSEl are also included in the present invention. A 
20 "variant", in the context of PSEl, refers to a protein having the sequence of native 
PSEl other than for at one or more positions where there have been amino acid 
insertions, deletions, or substitutions, either conservative or non-conservative, 
provided that such changes resixLt in a protein whose basic properties, for example 
enzymatic activity (type of and specific activity), thermostabihty, activity in a certain 
25 pH-range (pH-stabihty) have not significantly been changed. "Significantly" in this 
context means that one skilled in the art would say that the properties of the variant 
may stiU be different but would not be unobvious over the ones of the original 
protein. 
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By "conservative substitutions" is intended combinations such as Val, He, Leu, Ala, 
Met; Asp, Glu; Asn, Gin; Ser, Thr, Gly, Ala; Lys, Axg, His; and Phe, Tyr, Txp. 
Preferred conservative substitutions include Gly, Ala; VaL, He, Leu; Asp, Glu; Asn, 
Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. 

5 

A "variant" of PSEl typically has at least 25%, at least 50%, at least 60% or at least 
70%, preferably at least 80%o, more preferably at least 90%, even more preferably at 
least 95%, yet more preferably at least 99%, most preferably at least 99.5% sequence 
identity to the sequence of native PSEl. 

10 

The percent sequence identity between two polypeptides may be determined using 
suitable computer programs, as discussed below. Such variants may be natural or 
made using the methods of protein engineering and site-directed mutagenesis as are 
well known in the art. 

15 

A "fragment", in tiie context of PSEl, refers to a protein having the sequence of 
native PSEl other than for at one or more positions where there have been deletions. 
Thus the fragment may comprise at most 5, 10, 20, 30, 40 or 50%, typically up to 
60%, more typically up to 70%, preferably up to 80%, more preferably up to 90%, 
20 even more preferably up to 95%, yet more preferably up to 99%» of the complete 
sequence of the frail mature PSEl protein. Particularly preferred fragments of PSEl 
protein comprise one or more whole domarus of the desired protein. 

A fragment or variant of PSEl may be a protein that, when expressed 
25 recombinantly in a host cell, such as S. cerevisiae, can complement the deletion of 
the endogenous PSEl gene in the host cell, and may, for example, be a naturally 
occurring homolog of PSEl, such as a homolog encoded by another organism, 
such as another yeast or other fimgi, or another eukaryote such as a human or 
other vertebrate, or animal or by a plant. 

30 
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Another preferred chaperone is ORM2 or a fragment or variant thereof having 
equivalent chaperone-like activity. 

ORM2^ also known as YLR350W, is located on chromosome XII (positions 
5 828729 to 829379) of the S, cer-evisiae genome and encodes aa evolutionarily 
conserved protein with similarity to the yeast protein Ormlp. Hjelmqvist et aU 
2002, Genome Biology, 3(6), research 0027.1-0027.16 reports that ORM2 belongs 
to gene family comprising three human genes (ORMDLl, 0RMDL2 and 
ORMDL3) as well as homologs in microsporidia, plants, Drosophila, 
10 urochordates and vertebrates. The ORMDL genes are reported to encode 
transmembrane proteuis anchored in the proteins endoplasmic reticulimi (ER). 

The protein Orm2p is requned for resistance to agents that mduce the unfolded 
protem response. Hjehnqvist et al, 2002 (supra) reported that a double knockout 
15 of the two S. cerevisiae ORMDL homologs {ORMl and ORM2) leads to a 
decreased growth rate and greater sensitivity to tunicamycm and dithiothreitol. 

One published sequence of Orm2p is as follows: 

20 MIDRTKNESPAFEESPLTPNVSNLKPFPSQSNKISTPVTDHRRRRSSSVISHVEQETFED 
ENDQQMLPNMNATWVDQRGAWLIHIVVIVLLRLFYSLFGSTPKWTWTLTNMTYIIGFYIM 
FHLVKGTPFDFNGGAYDNLTMWEQINDETLYTPTRKFLLIVPIVLFLISNQYYRNDMTLF 

LSNLAVTVLI GVVPKLGI THRLRI S I PGI TGRAQI S ^ 

25 The above protein is encoded in S, cerevisiae by the following coding nucleotide 
sequence, although it will be appreciated that the sequence can be modified by 
degenerate substitutions to obtain alternative nucleotide sequences which encode 
an identical protein product: 

30 ATGATTGACCGCACTAAAAACGAATCTCCAGCTTTTGAAGAGTCTCCGCTTACCCCCAAT 
GTGTCTAACCTGAAACCATTCCCTTCTCAAAGCAACAAAATATCCACTCCAGTGACCGAC 
CATAGGAGAAGACGGTCATCCAGCGTAATATCACATGTGGAACAGGAAACCTTCGAAGAC 
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GAAAATGACCAGCAGATGCTTCCCAACATGAACGCTACGTGGGTCGACCAGCGAGGCGCG 
TGGTTGATTCATATCGTCGTAATAGTACTCTTGAGGCTCTTCTACTCCTTGTTCGGGTCG 
ACGCCCAAATGGACGTGGACTTTAACAAACATGACCTACATCATCGGATTCTATATCATG 
TTCCACCTTGTCAAAGGTACGCCCTTCGACTTTAACGGTGGTGCGTACGACAACCTGACC 
5 ATGTGGGAGCAGATTAACGATGAGACTTTGTACACACCCACTAGAAAATTrCTGCTGATT 
GTACCCATTGTGTTGTTCCTGATTAGCAACCAGTACTACCGGAACGACATGACACTATTC 
CTCTCCAACCTCGCCGTGACGGTGCTTATTGGTGTCGTTCCTAAGCTGGGAATTACGCAT 
AGACTAAGAATATCCATCCCTGGTATTACGGGCCGTGCTCAAATTAGTTAG 

10 Variants and fragments of ORM2 are also included in the present invention. A 
"variant", in the context of ORM2, refers to a protein having the sequence of native 
ORM2 other than for at one or more positions where there have been amino acid 
insertions, deletions, or substitutions, either conservative or non-conservative, 
provided that such changes result in a protein whose basic properties, for example 

15 eiLzymatic activity (type of and specific activity), thermostability, activity in a certain 
pH-range (pH-stability) have not significantly been changed. "Significantly" in this 
context means that one skilled iu the art would say that the properties of the variant 
may still be different but would not be unobvious over the ones of the original 
protein. 

20 

By "conservative substitutions" is intended combinations such as Vai, He, Leu, Ala, 
Met; Asp, Glu; Asn, Gin; Ser, Thr, Gly, Ala; Lys, Arg, His; and Phe, Tyr, Trp. 
Preferred conservative substitutions uiclude Gly, Ala; Val, lie, Leu; Asp, Glu; Asn, 
Ghx; Ser, Thr; Lys, Arg; and Phe, Tyr. 

25 

A "variant" of ORM2 typically has at least 25%, at least 50%, at least 60% or at least 
70%, preferably at least 80%, more preferably at least 90%, even more preferably at 
least 95%, yet more preferably at least 99%, most preferably at least 99.5% sequence 
identity to the sequence of native 0RM2. 

30 

i 

The percent sequence identity between two polypeptides may be determined using 
suitable computer programs, as discussed below. Such variants may be natural or 
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made using the methods of protein engineering and site-directed mutagenesis as are 
well known in the art. 

A "fragment", in the context of ORM2, refers to a protein having the sequence of 
5 native 0RM2 other than for at one or more positions where there have been 
deletions. Thus the fragment may comprise at most 5, 10, 20, 30, 40 or 50%, 
typically up to 60%, more typically up to 70%, preferably up to 80%, more 
preferably up to 90%, even more preferably up to 95%, yet more preferably up to 
99% of the complete sequence of the full mature ORM2 protem. Particularly 
10 preferred fragments of ORM2 protein comprise one or more whole domains of the 
desired proteia. 

A fragment or variant of ORM2 may be a protein that, when expressed 
recombmantly in a host cell, such as S. cerevisiae, can complement the deletion of 
15 the endogenous ORM2 gene in the host cell and may, for example, be a naturally 
occurring homolog of ORM2, such as a homolog encoded by another organism, 
such as another yeast or other ftmgi, or another eukaryote such as a human or 
other vertebrate, or animal or by a plant. 

20 A gene encodhig a protein comprising the sequence of a chaperone may be formed 
m a lilce manner to that discussed below for genes encoding heterologous proteins, 
with particular emphasis on combinations of ORFs and regulatory regions. 

The term "protein" as used herein includes all natural and non-natural proteins, 
25 polypeptides and peptides. A "heterologous protem" is a protein that is not 
naturally encoded by a 2|am-family plasmid and can also be described as a "non 
2|am-family plasmid protein". For convenience, the terms "heterologous protein" 
and "non 2|am-family plasmid protein" are used synonymously throughout this 
application. Preferably, therefore, the heterologous protein is not a FLP, REPl, 
30 REP2, or a RAF/D protein as encoded by any one of pSRl, pSB3 or pSB4 as. 
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obtained firom Z Touxii^ pSBl or pSB2 botli as obtained from Z, bailli^ pSMl as 
obtained from Z fermentatU pKDl as obtained from K, drosophilarum, pPMl as 
obtained from P. membranaefaciens or the 2jxm plasmid as obtained from 51 
cerevisiae. 

5 

A gene encoding a heterologous protein comprises polynucleotide sequence 
encoding the heterologous protein (typically according to standard codon usage 
for any given organism), designated the open reading frame ("ORF"). The gene 
may additionally comprise some polynucleotide sequence that does not encode an 
10 open reading frame (termed "non-coding region")- 

Non-coding region in the gene may contain one or more regulatory sequences, 
operatively linked to the ORF, which allow for the transcription of the open 
readiag frame and/or translation of the resultant transcript. 

15 

The term "regulatory sequence" refers to a sequence that modulates (i.e., promotes 
or reduces) the expression (i.e., the transcription and/or translation) of an ORF to 
which it is operably linked. Regulatory regions typically include promoters, 
terminators, ribosome binding sites and the like. The skilled person will 
20 appreciate that the choice of regulatory region will depend upon the intended 
expression system. For example, promoters may be constitutive or inducible and 
may be cell- or tissue-type specific or non-specific. 

Siiitable regulatory regions, may be 5bp, lObp, 15bp, 20bp, 25bp, 30bp, 35bp, 
25 40bp, 45bp, 50bp, 60bp, 70bp, 80bp, 90bp, lOObp, 120bp, 140bp, 160bp, 180bp, 
200bp, 220bp, 240bp, 260bp, 280bp, 300bp, 350bp, 400bp, 450bp, 500bp, 550bp, 
600bp, 650bp, 700bp, 750bp, 800bp, 850bp, 900bp, 950bp, lOOObp, llOObp, 
1200bp, 1300bp, 1400bp, ISOObp or greater, in length. 
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Those skilled in the art will recognise that the gene encoding the chaperone, for 
example PDI, may additionally comprise non-coding regions and/or regulatory 
regions. Such non-coding regions and regulatory regions are not restricted to the 
native non-coding regions and/or regulatory regions normally associated with the 
5 chaperone ORF. 

Where the expression system is yeast, such as Saccharomyces cerevisiae^ suitable 
promoters for S. cerevisiae include those associated with the PGKl gene, GALl or 
GALIO genes, TEFl, TEF2, PYKl, PMAl, CYCl, PH05, TRPl, ADHl, ADH2, 

10 the genes for glyceraldehyde-3 -phosphate dehydrogenase, hexokinase, pyruvate 
decarboxylase, phosphofructokinase, triose phosphate isomerase, phosphoglucose 
isomerase, glucokinase, a-mating factor pheromone, a-mating factor pheromone, 
the PRBl promoter, the PRAl promoter, the GPDl promoter, and hybrid 
promoters involving hybrids of parts of 5' regulatory regions with parts of 5* 

15 regulatory regions of other promoters or with upstream activation sites (e.g. the 
promoter of EP-A-258 067). 

Suitable transcription termination signals are well known in the art. Where the 
host cell is eukaryotic, the transcription termination signal is preferably derived 

20 from the 3' flanking sequence of a eukaryotic gene, which contains proper signals 
for transcription termination and polyadenylation. Suitable 3' flanking sequences 
may, for example, be those of the gene naturally linked to the expression control 
sequence used, i.e. may correspond to the promoter. Alternatively, they may be 
different. In that case, and where the host is a yeast, preferably S. cerevisiae^ then 

25 the termination signal of the S, cerevisiae ADHl, ADH2^ CYCl^ or PGKl genes 
are preferred. 

It may be beneficial for the promoter and open reading frame of the heterologous 
gene, such as the those of the chaperone PDII, to be flanked by transcription 
30 termination sequences so that the transcription termination sequences are located 
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both upstream and downstream of the promoter and open reading fiame, in order 
to prevent transcriptional read-through into neighbouring genes, such as 2jLtm 
genes, and visa versa. 

5 In one embodiment, the favoured regulatory sequences in yeast, such as 
Saccharomyces cerevisiae, include: a yeast promoter (e.g. the Saccharojnyces 
cerevisiae PRBl promoter), as taught in EP 431 880; and a transcription 
terminator, preferably the terminator from Saccharomyces ADHl^ as taught in EP 
60 057. Preferably, the vector incorporates at least two translation stop codons. 

10 

It may be beneficial for the non-coding region to incorporate more than one DNA 
sequence encoding a translational stop codon, such as UAA, UAG or UGA, in 
order to minimise translational read-through and thus avoid the production of 
elongated, non-natural fusion proteins. The translation stop codon UAA is 
15 preferred. 

The term "operably linked" includes within its meaning that a regulatory sequence 
is positioned within any non-codiug region in a gene such that it forms a 
relationship wilh an ORF that permits the regulatory region to exert an effect on 
20 the ORF in its intended manner. Thus a regulatory region "operably linked" to an 
ORF is positioned in such a way that the regulatory region is able to influence 
transcription and/or translation of the ORF in the intended maimer, under 
conditions compatible with the regulatory sequence. 

25 In one preferred embodiment, the heterologous protein is secreted. In that case, a 
sequence encoding a secretion leader sequence which, for example, comprises 
most of the natural HSA secretion leader, plus a small portion of the S. cerevisiae 
a-matkig factor secretion leader as taught in WO 90/01063 may be included in the 
open reading firame. 

30 
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Alternatively, the heterologous protein may be intracellular. 

In another preferred embodiment, the heterologous protein comprises the 
sequence of a eukaryotic protein, or a fragment or variant thereof Suitable 
5 eukaryotes include fungi, plants and animals. In one preferred embodiment the 
heterologous protein is a fungal protein, such as a yeast protein. In another 
preferred embodiment the heterologous protein is an animal protein. Exemplary 
animals include vertebrates and invertebrates. Exemplary vertebrates include 
mammals, such as humans, and non-human mammals. 

10 

Thus the heterologous protein may comprise the sequence of a yeast protein. It 
may, for example, comprise the sequence of a yeast protein firom the same host 
j&om which the 2|irn -family plasmid is derived. Those skilled in the art will 
recognise that a method, use or plasmid of the first, second or third aspects of the 
15 invention may comprise DNA sequences encoding more than one heterologous 
protein, more than one chaperone, or more than one heterologous protein and 
more than one chaperone. 

In another preferred embodunent, the heterologous protein may comprise the 
20 sequence of albumin, a monoclonal antibody, an etoposide, a serum protein (such 
as a blood clotting factor), antistasin, a tick anticoagulant peptide, transferrin, 
lactoferrin, endostatm, angiostatin, collagens, immunoglobulins or 
immunoglobulin-based molecules or j&agment of either (e.g. a Small Modular 
IramunoPharmaceutical™ ("SMIP") or dAb, Fab' fragments, F(ab')2, scAb, scFv 
25 or scFv fragment), a Kxmitz domain protein (such as those described in WO 
03/066824, with or without albumin fusions), interferons, interleukins, ILIO, ILl 1, 
IL2, interferon a species and sub-species, interferon p species and sub-species, 
interferon y species and sub-species, leptin, CNTF, CNTFaxis. ILl -receptor 
antagonist, erythropoietin (EPO) and EPO mimics, thrombopoietin (TPO) and 
30 TPO mimics, prosaptide, cyanovirin-N, 5-helix, T20 peptide, T1249 peptide, HIV 
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gp41, HIV gpl20, urokinase, prourokinase, tPA, hirudin, platelet derived growth 
factor, parathyroid hormone, proinsulin, insulin, glucagon, glucagon-like peptides, 
insulin-hke growth factor, calcitonin, growth hormone, transforming growth factor 
(3, tumour necrosis factor, G-CSF, GM-CSF, M-CSF, FGF, coagulation factors in 
5 both pre and active forms, including but not limited to plasminogen, fibrinogen, 
thrombin, pre-thrombiti, pro-thrombin, von Willebrand's factor, ai -antitrypsin, 
plasminogen activators. Factor VII, Factor VIII, Factor IX, Factor X and Factor 
Xm, nerve growth factor, LACI, platelet-derived endothelial cell grovrth factor 

1 

(PD-ECGF), glucose oxidase, serum cholinesterase, aprotmin, amyloid precursor 
10 protein, inter-alpha trypsin inhibitor, antithrombin III, apo-lipoprotein species. 
Protein C, Protein S, or a variant or fragment of any of the above. 

A 'Variant", in the context of the above-listed proteins, refers to a proteia wherein at 
one or more positions there have been amino acid insertions, deletions, or 

15 substitutions, either conservative or non-conservative, provided that such changes 
result in a protein whose basic properties, for example enzymatic activity or receptor 
binding (type of and specific activity), thermostabiUty, activity in a certain pH-range 
(pH-stability) have not significantly been changed. "Significantly" in this context 
means that one skilled in the art would say that the properties of the variant may still 

20 be different but would not be unobvious over the ones of the original protein. 

By "conservative substitutions" is intended combinations such as Val, He, Leu, Aia, 
Met; Asp, Glu; Asn, Ghi; Ser, Thr, Gly, Ala; Lys, Arg, His; and Phe, Tyr, Tip. 
Preferred conservative substitutions include Gly, Ala; Val, He, Leu; Asp, Glu; Asm, 
25 Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. 

A "variant" typically has at least 25%, at least 50%, at least 60% or at least 70%, 
preferably at least 80%, more preferably at least 90%, even more preferably at least 
95%, yet more preferably at least 99%, most preferably at least 99.5% sequence 
30 identity to the polypeptide from which it is derived. 
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The percent sequence identity between two polypeptides may be determined using 
suitable computer programs, for example the GAP program of the University of 
Wisconsin Genetic Computing Group and it will be appreciated that percent 
5 identity is calculated in relation to polypeptides whose sequence has been aligned 
optimally. 

The alignment may alternatively be carried out using the Clustal W program 
(Thompson et aL, (1994) Nucleic Acids Res., 22(22), 4673-80), The parameters 
10 used may be as follows: 

• Fast pairwise alignment parameters: K-tuple(word) size; 1, window size; 5, 
gap penalty; 3, number of top diagonals; 5. Scoring method: x percent. 

• Multiple alignment parameters: gap open penalty; 10, gap extension penalty; 
0.05. 

15 • Scoring matrix: BLOSUM. 

Such variants may be natural or made using the methods of protein engineering and 
site-directed mutagenesis as are well known in the art. 

20 A ''jfragmenf, in the context of the above-listed proteins, refers to a protein wherein 
at one or more positions there have been deletions. Thus the fragment may comprise 
at most 5, 10, 20, 30, 40 or 50% of the complete sequence of the fiill mature 
polypeptide. Typically a fragment comprises up to 60%, more typically up to 70%, 
preferably up to 80%, more preferably up to 90%, even more preferably up to 95%, 

25 yet more preferably up to 99% of the complete sequence of the friU desired protein. 
Particularly preferred fragments of a protein comprise one or more whole domains of 
the protein. 

In one particularly preferred embodiment the heterologous protein comprises the 
3 0 sequence of albumin or a variant or fragment thereof. 
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By "albumin" we include a protein comprising the sequence of an albumin protein 
obtained from any source. Typically the source is mammalian. In one preferred 
embodiment the serum albumin is huinan serum albumin ("HSA"). The term 

5 "human serum albumin" includes the meaning of a serum albumm having an 
amino acid sequence naturally occurring in humans, and variants thereof. 
Preferably the albumin has the amino acid sequence disclosed in WO 90/13653 or 
a variant thereof. The HSA coding sequence is obtainable by known methods for 
isolating cDNA corresponding to human genes, and is also disclosed in, for 

10 example, EP 73 646 and EP 286 424. 

In another preferred embodiment the "albumin" comprises the sequence of bovine 
serum albumin. The term "bovine serum albxmnn" includes the meaning of a 
serum albumin having an amino acid sequence naturally occurring in cows, for 
15 example as taken from Swissprot accession number P02769, and variants thereof 
as defined below. The term "bovine senom albumin" also includes the meaning of 
fragments of full-length bovine serum albumin or variants thereof, as defined 
below. 

20 In another preferred embodiment the albumin comprises the sequence of an 
albumin derived from one of serum albumm. from dog (e.g. see Swissprot 
accession number P49822), pig (e.g. see Swissprot accession number P08835), 
goat (e.g. as available from Sigma as product no. A2514 or A4164), turkey (e.g. 
see Swissprot accession number 073860), baboon (e.g. as available from Sigma 

25 as product no. A1516), cat (e.g. see SAvissprot accession number P49064), chicken 
(e.g. see Swissprot accession number P19121), ovalbumin (e.g. chicken 
ovalbumin) (e.g. see Swissprot accession number P01012), donkey (e.g. see 
Swissprot accession number P39090), gumea pig (e.g. as available from Sigma as 
product no. A3060, A2639, 05483 or A6539), hamster (e.g. as available from 

30 Sigma as product no. A5409), horse (e.g. see Swissprot accession number 

P35747), rhesus monkey (e.g. see Swissprot accession number Q28522), mouse 
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(e.g. see Swissprot accession number 089020), pigeon (e.g. as defined by Khan et 
al, 2002, Int. J. Biol. MacromoL, 30(3-4),171-8), rabbit (e.g. see Swissprot 
accession number P49065), rat (e.g. see Swissprot accession number P36953) and 
sheep (e.g. see Swissprot accession number P14639) and includes variants and 
5 jfragments thereof as defiiied below. 

Many naturally occurring mutant forms of albumin are known. Many are 
described in Peters, (1996, All About Albumin: Biochemistry, Genetics and Medical 
Applications, Academic Press, Inc., San Diego, California, p.170-181). A variant as 
10 defined above may be one of these naturally occurring mutants. 

A "variant albumin" refers to an albumin protein wherein at one or more positions 

there have been amino acid insertions, deletions, or substitutions, either conservative 

or non-conservative, provided that such changes result in an albumin protein for 

15 which at least one basic property, for example binding activity (type of and specific 

ft 

activity e.g. binding to bilirubin), osmolarity (oncotic pressure, colloid osmotic 
pressure), behaviour in a certain pH-range (pH-stabihty) has not significantly been 
changed. "Significantly" in this context means that one skilled in the art would say 
that the properties of the variant may still be different but would not be unobvious 
20 over the ones of tlie original protein. 

By "conservative substitutions" is intended combinations such as Gly, Ala; Val, He, 
Leu; Asp, Glu; Asn, Ghi; Ser, Thr; Lys, Arg; and Phe, Tyr. Such variants may be 
made by techniques well known in the art, such as by site-directed mutagenesis as 
25 disclosed in US Patent No 4,302,386 issued 24 November 1981 to Stevens, 
incorporated herein by reference. 

Typically an albumin variant will have more than 40%, usually at least 50%, more 
typically at least 60%, preferably at least 70%, more preferably at least 80%, yet 
30 more preferably at least 90%, even more preferably at least 95%, most preferably at 
least 98% or more sequence identity with naturally occurring albumin. The percent 
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sequence identity between two polypeptides may be determined using suitable 
computer programs, for example the GAP program of the University of Wisconsin 
Genetic Computing Group and it will be appreciated that percent identity is 
calculated in relation to polypeptides whose sequence has been aligned optimally. 
5 The alignment may alternatively be carried out using the Clustal W program 
(Thompson et al, 1994). The parameters used may be as follows: 

Fast pairwise aligmnent parameters: K-tuple(word) size; 1, window size; 5, gap 
penalty; 3, number of top diagonals; 5. Scoring method: x percent. Multiple 
10 aligmnent parameters: gap open penalty; 10, gap extension penalty; 0.05. Scoring 
matrix: BLOSUM. 

The term "fragment" as used above includes any fragment of full-length albimiin 
or a variant thereof, so long as at least one basic property, for example binding 

15 activity (type of and specific activity e.g. binding to bilirubin), osmolarity (oncotic 
pressure, colloid osmotic pressure), behaviour in a certain pH-range (pH-stability) 
has not significantly been changed. "Significantly" in this context means that one 
skilled in the art would say that the properties of the variant may. still be different but 
would not be unobvious over the ""ones of the original protein. A fragment will 

20 typically be at least 50 amino acids long. A firagment may comprise at least one 
whole sub-domain of albumin. Domains of HSA have been expressed as 
recombmant proteins (Dockal, M. et a/., 1999, J, Biol Chem., 274, 29303-29310), 
where domain I was defined as consisting of amino acids 1-197, domain II was 
defmed as consisting of amino acids 189-385 and domain III was defined as 

25 consisting of amino acids 381-585. Partial overlap of the domains occurs because 
of the extended a-helix structure (hlO-hl) which exists between domains I and 11, 
and between domams II and HI (Peters, 1996, op, cit. Table 2-4). HSA also 
comprises six sub-domains (sub-domams lA, IB, IIA, IIB, IIIA and HIB). Sub- 
domain lA comprises amino acids 6-105, sub-domain IB comprises amino acids 

30 120-177, sub-domain IIA comprises amino acids 200-291, sub-domain IIB 
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comprises amino acids 316-369, sub-domain IIIA comprises amino acids 392-491 
and sub-domain IIIB comprises amino acids 512-583. A fragment may comprise 
a whole or part of one or more domains or sub-domains as defined above, or any 
combination of those domains and/or sub-domains. 

5 

In another particularly preferred embodunent the heterologous protein comprises 
the sequence of transferrin or a variant or fragment thereof. The term 
"transferrin" as used herein includes all members of the transferrin family (Testa, 
Proteins of iron metabolism, CRC Press, 2002; Harris & Aisen, Iron carriers and 

10 iron proteins. Vol. 5, Physical Bioinorganic Chemistry, VCH, 1991) and their 
derivatives, such as transferrin, mutant transferrins (Mason et al, 1993, 
Biochemistry, 32, 5472; Mason et al, 1998, Biochem. J., 330(1), 35), truncated 
transferrins, transferrin lobes (Mason et al, 1996, Protein Expr. Purif, 8, 119; 
Mason et al, 1991, Protein Expr. Pur if., 2, 214), lactoferrin, mutant lactoferrins, 

15 truncated lactoferrins, lactoferrin lobes or fusions of any of the above to other 
peptides, polypeptides or proteins (Shin et al, 1995, Proc. Natl Acad, Set USA, 
92, 2820; Ah et al, 1999, X Biol Chem., 274, 24066; Mason et al, 2002, 
Biochemistry, 41, 9448). 

20 The transferrin may be human transferrin. The term "human transferrin" is used 
herem to denote material which is indistinguishable from transferrin derived from 
a human or which is a variant or fragment thereof A "variant" includes 
insertions, deletions and substitutions, either conservative or non-conservative, 
where such changes do not substantially alter the useful ligand-binding or 

25 immunogenic properties of transferrin. 

Mutants of transferrin are included in the invention. Such mutants may have 
altered immunogenicity. For example, transferrin mutants may display modified 
(e.g. reduced) glycosylation. The N-linked glycosylation partem of a transferrin 
30 molecule can be modified by adding/removing amino acid glycosylation 
consensus sequences such as N-X-S/T, at any or all of the N, X, or S/T position. 
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Transferrin mutants may be altered in their natural binding to metal ions and/or 
other proteins, such as transferrin receptor. An example of a transferrin mutant 
rnodified in this manner is exemplified below. 

5 We also include naturally-occurring polymorphic variants of human transferrin or 
human transferrin analogues. Generally, variants or fragments of human 
transferrin wiU have at least 5%, 10%, 15%, 20%, 30%, 40% or 50% (preferably 
at least 80%, 90% or 95%) of human transferrin's ligand binding activity (for 
example iron-binding), weight for weight. The iron binding activity of transferrin 

10 or a test sample can be determined spectrophotometrically by 470nm:280nm 
absorbance ratios for the proteins in their iron-free and fully iron-loaded states. 
Reagents should be iron-free unless stated otherwise. Iron can be removed from 
transferrin or the test sample by dialysis against O.IM citrate, O.IM acetate, lOmM 
EDTA pH4.5. Protein should be at approximately 20mg/mL in lOOmM HEPES, 

15 lOmM NaHCOs pH8.0. Measure the 470nm:280nm absorbance ratio of apo- 
transferrin (Calbiochem, CN Biosciences, Nottingham, UK) diluted in water so 
that absorbance at 280nm can be accurately determined spectrophotometrically 

> 

(0% iron binding). Prepare 20mM iron-nitrilotriacetate (FeNTA) solution by 
dissolving 191mg nitrotriacetic acid in 2mL IM NaOH, then add 2mL 0.5M ferric 

20 chloride. Dilute to SOrnL with deionised water. Fully load apo-transferrin with 
iron (100% iron binding) by adding a sufficient excess of freshly prepared 20mM 
FeNTA, then dialyse the holo-transferrin preparation completely against 1 OOmM 
HEPES, lOmM NaHCOs pH8;0 to remove remaining FeNTA before measuring 
the absorbance ratio at 470nm:280mn. Repeat the procedure using test sample, 

25 which should initially be free from iron, and compare final ratios to the control. 

Additionally, single or multiple heterologous fusions comprising any of tiie above; 

or single or multiple heterologous fusions to albumin, tian^ferriti or 

immunoglobins or a variant or- fragment of any of these may be used. Such 

30 fiisions include albunain N-tenninal ftisions, albumin C-terminal ftisions and co- 

N-terminal and C-terminal albumin fusions as exemplified by WO 01/79271, and 
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transferrin N-terminal fusions, transferrin C-terminal fiasions, and co-N-terminal 
and C-terminal transferrin fusions. 

Examples of transferrin fusions are given in US patent applications 
5 US2003/0221201 and US2003/0226155, Shin, et al., 1995, Proc Natl Acad Sci U 
S A, 92, 2820, Ali, et al., 1999, J Biol Chem, 274, 24066, Mason, et al, 2002, 
Biochemistry. 41, 9448, the contents of which are incorporated herein by 
reference. 

10 The skilled person will also appreciate that the open reading frame of any other 
gene or variant, or part or either, can be utilised as an open reading frame for use 
with the present invention. For example, the open reading frame may encode a 
protein comprismg any sequence, be it a natural protem (including a zymogen), or 
a variant, or a fragment (which may, for example, be a domain) of a natural 

15 protein; or a totally synthetic protem; or a smgle or multiple fusion of different 
protems (natural or synthetic). Such proteins can be taken, but not exclusively, 
from the lists provided in WO 01/79258, WO 01/79271, WO 01/79442, WO 
01/79443, WO 01/79444 and WO 01/79480, or a variant or fragment thereof; the 
disclosures of which are mcorporated herein by reference. Although these patent 

20 apphcations present the list of proteins in the context of fusion partners for 
albunun, the present mvention is not so limited and, for the purposes of the 
present invention, any of the proteins listed therein may be presented alone or as 
fusion partners for albumin, the Fc region of immunoglobulm, transferrin, 
lactoferrin or any other protein or fragment or variant of any of the above, as a 

25 desired polypeptide. 

The heterologous protein may be a therapeutically active protein. In other words, 
it may have a recognised medical effect on individuals, such as humans. Many 
different types of therapeutically active protein are well known in the art. 

30 
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The heterologous protein may comprise a leader sequence effective to cause 
secretion in yeast. 

Numerous natural or artificial polypeptide signal sequences (also called secretion 
5 pre regions) have been used or developed for secreting proteins from host cells. 
The signal sequence directs the nascent protein towards the machinery of the cell 
that exports proteins from the cell into the surrounding medium or, m some cases, 
into the periplasmic space. The signal sequence is usually, although not 
necessarily, located at the N-terminus of the primary translation product and is 
10 generally, although not necessarily, cleaved off the protein during the secretion 
process, to yield the "mature" protein. 

In the case of some protems the entity that is initially secreted, after the removal 
of the signal sequence, includes additional amino acids at its N-terminus called a 
15 "pro" sequence, the intermediate entity bemg called a "pro-protein". These pro 
sequences may assist the final protein to fold and become functional, and are 
usually then cleaved off. hi other mstances, the pro region shnply provides a 
cleavage site for an enzyme to cleave off the pre-pro region and is not known to 
have another function. 

20 

The pro sequence can be removed either during the secretion of the protein from 
the cell or after export from the cell uito the surrounding mediimi or periplasmic 
space. 

25 Polypeptide sequences which direct the secretion of proteins, whether they 
resemble signal (i.e. pre) sequences or pre-pro secretion sequences, are referred to 
as leader sequences. The secretion of proteins is a dynamic process involvmg 
translation, translocation and post-translational processing, and one or more of 
these steps may not necessarily be completed before another is either initiated or 

30 completed. 
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For production of proteins in eukaryotic species such as the yeasts Saccharomyces 
cerevisiae, Zygosaccharomyces species, Kluyveromyces lactis and Pichia pastoris^ 
known leader sequences include those j&om the S. cerevisiae acid phosphatase 
protem (Pho5p) (see EP 366 400), the invertase protein (Suc2p) (see Smith et al 

5 (1985) Science, 229, 1219-1224) and heat-shock protein-150 (HsplSOp) (see WO 
95/33833). Additionally, leader sequences from the S, cerevisiae mating factor 
alpha- 1 protein (MFa-1) and from the human lysozyme and human serum 
albumin (HSA) protein have been used, the latter having been used especially, 
although not exclusively, for secreting human albumia. WO 90/01063 discloses a 

10 fusion of the MFa-1 and HSA leader sequences, which advantageously reduces 
the production of a contaminating fragment of human albumin relative to the use 
of the MFa-1 leader sequence. Modified leader sequences are also disclosed in 
the examples of this application and the reader will appreciate that those leader 
sequences can be used with proteins other than transferrin. In addition, the natural 

15 transferrin leader sequence may be used to direct secretion of transferrin and other 
heterologous proteins. 

Where the chaperone is protein disulphide isomerase, then preferably the 
heterologous protein comprises disulphide bonds in its mature form. The 
20 disulphide bonds may be intramolecular and/or intermolecular. 

The heterologous protein may be a commercially useful protein. Some 
heterologously expressed proteins are intended to interact with the ceU in which 
they are expressed m order to bring about a beneficial effect on the cell's 

25 activities. These proteins are not, in their own right, commercially usefiil. 
Commercially useful proteins are proteins that have a utility ex vivo of the cell in 
which Hiey are expressed. Nevertheless, the skilled reader will appreciate that a 
commercially useful protein may also have a biological effect on the host cell 
expressing it as a heterologous protein, but that that effect is not the main or sole 

30 reason for expressing the protein therein. 
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In one embodiment it is preferred that the heterologous protein is not p-lactamase. 
In another embodiment it is preferred that the heterologous protein is not 
antistasin. However^ the reader will appreciate that neither of these provisos 
5 exclude genes encoding either ^-lactamase or antistasin from being present on the 
9|iTn-fflmi1y plasmid of the invention, merely that the gene encoding the 
heterologous protein encodes a protein other than P-lactamase and/or antistasin. 

Plasmids can be prepared by modifying 2pm-family plasmids known in the art by 
10 inserting a gene encoding a chaperone and inserting a gene encoding a heterologous 
protein using techniques well known in the art such as are described in by Sambrook 
et al, Molecular Cloning: A Laboratory Manual, 2001, 3rd edition, the contents 
of which are incorporated herein by reference. For example, one such method 
mvolves ligation via cohesive ends. Compatible cohesive ends can be generated on a 
15 DNA fragment for insertion and plasmid by the action of suitable restriction 
enzymes. These ends will rapidly anneal through complementary base pairing and 
remaining nicks can be closed by the action of DNA ligase. 

A fijrther method uses synthetic double stranded oligonucleotide linkers and 
' 20 adaptors. DNA fragments with blunt ends are generated by bacteriophage T4 DNA 
polymerase or KcoU DNA polymerase I which remove protruding 3' temiini and fill 
in recessed 3' ends. Synthetic linkers and pieces of blunt-ended double-stranded 
DNA, which contain recognition sequences for defined restriction enzymes, can be 
hgated to blunt-ended DNA fragments by T4 DNA ligase. They are subsequently 
25 digested with appropriate restriction enzymes to create cohesive ends and hgated to 
an expression vector with compatible termini. Adaptors are also chemically 
synthesised DNA fragments which contain one blunt end used for hgation but which 
also possess one preformed cohesive end. Alternatively a DNA fragment or DNA 
fragments can be ligated together by the action of DNA ligase in the presence or 
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absence of one or more synthetic double stranded oligonucleotides optionally 
containing cohesive ends. 

Synthetic liolcers containing a variety of restriction endonuclease sites are 
5 commercially available from a number of soxorces including Sigma-Genosys Ltd, 
London Road, Pampisford, Cambridge, United Kingdom. 

Appropriate insertion sites in 2|jm~family plasmids include, but are not limited to, 
those discussed above. 

10 

The present invention also provides a host cell comprising a plasmid as defmed 
above. The host cell may be any type of cell. Bacterial and yeast host cells are 
preferred. Bacterial host cells may be useful for cloning purposes. Yeast host 
cells may be useful for expression of genes present in the plasmid. 

15 

In one embodiment the host cell is a yeast cell, such as a member of the 
Saccharomyces, Kluyveromyces, or Pichia genus, such Saccharomyces cerevisiae, 
Kluyveromyces lactis, Pichia pastoris and Pichia membranaefaciens, or 
Zygosaccharomyces rouxii, Zygosaccharomyces bailii, Zygosaccharomyces 
20 fermentati, or Kluyveromyces drosphilarum are preferred. 

The host cell type may be selected for compatibility with the plasmid type benig 
used. Plasmids obtained from one yeast type can be maintained in other yeast 
types (Irie et al, 1991, Gene, 108(1), 139-144; Irie et ah 1991, MoL Gen, Genet, 

25 225(2), 257-265). For example, pSRl from Zygosaccharomyces rouxii can be 
maintained in Saccharomyces cerevisiae. Preferably, the host cell is compatible 
with the 2|jm-family plasmid used (see below for a full description of the 
following plasmids). For example, where the plasmid is based on pSRl, pSB3 or 
pSB4 then a suitable yeast cell is Zygosaccharomyces rouxii; where the plasmid is 

30 based on pSBl or pSB2 then a suitable yeast cell is Zygosaccharomyces bailli; 
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where the plasmid is based on pSMl then a suitable yeast cell is 
Zygosaccharomyces fermentati; where the plasmid is based on pKDl then a 
suitable yeast cell is Kluyveromyces drosophilarum; where the plasmid is based on 
pPMl then a suitable yeast cell is Fichia membranaefaciens\ where the plasmid is 
5 based on the 2|LLni plasmid then a suitable yeast cell is Saccharomyces cerevisiae 
or Saccharomyces carlsbergensis. It is particularly preferred that the plasmid is 
based on the Ipm plasmid and the yeast cell is Saccharomyces cerevisiae. 

A 2|xm-family plasmid of the invention can be said to be "based on" a natxorally 
10 occurring plasmid if it comprises one, two or preferably three of the genes FLP, 
REPl and REP2 having sequences derived from that naturally occurring plasmid. 

It may be particularly advantageous to use a yeast deficient in one or. more protein 
mannosyl transferases involved in O-glycosylation of proteins, for instance by 
15 disruption of the gene coding sequence. 

Recombitiantly expressed proteins can be subject to undesirable post-translational 
modifications by the producmg host cell. For example, the albumm protein 
sequence does not contain any sites for N-linlced glycosylation and has not been 

20 reported to be modified, in nature, by O-lmlced glycosylation. However, it has 
been found that recombinant human albumin ("rHA") produced m a number of 
yeast species can be modified by O-lmked glycosylation, generally involving 
mannose. The mannosylated albumin is able to bind to the lectin Concanavalin A. 
The amount of mannosylated albumin produced by the yeast can be reduced by 

25 using a yeast strain deficient in one or more of the PMT genes (WO 94/04687). 
The most convenient way of achieving this is to create a yeast which has a defect 
in its genome such that a reduced level of one of the Pmt proteins is produced. 
For example, there may be a deletion, insertion or transposition in the coding 
sequence or the regulatory regions (or in another gene regulating the expression of 

30 one of the PMT genes) such that little or no Pmt protein is produced. 
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Alternatively, the yeast coxild be transformed to produce an anti-Pmt agent, such 
as an anti-Pmt antibody. 

If a yeast other than S. cerevisiae is used, disruption of one or more of the genes 
5 equivalent to the PMT genes of & cerevisiae is also beneficial, e.g. in Pichia 
pastoris or Kluyveromyces lactis. The sequence of PMTl (or any other PMT 
gene) isolated from S. cerevisiae may be used for the identification or disruption 
of genes encoding similar enzymatic activities in other fungal species. The 
cloning of the PMTl homologue of Kluyveromyces lactis is described in WO 
10 94/04687. 

The yeast will advantageously have a deletion of the HSP150 and/or YAPS genes 
as taught respectively m WO 95/33833 and WO 95/23857. 

15 A plasmid as defmed above, may be introduced into a host through standard 
techniques. With regard to transformation of prokaryotic host cells, see, for 
example, Cohen et al (1972) Proc. Natl Acad Set USA 69, 2110 and Sambrook et 
al (2001) Molecular Cloning, A Laboratory Manual, 3'^ Ed. Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY. Transformation of yeast cells is described in 

20 Sherman et al (1986) Methods In Yeast Genetics, A Laboratory Manual, Cold Spring 
Harbor, NY. The method of Beggs (1978) Nature 275, 104-109 is also useful. 
Methods for Ihe transformation of S, cerevisiae are taught generally in EP 25 1 
744, EP 258 067 and WO 90/01063, all of which are incorporated herein by 
reference. With regard to vertebrate cells, reagents useful in transfecting such cells, 

25 for example calcium phosphate and DEAE-dextran or liposome formulations, are 
available firom Stratagene Cloning Systems, or Life Technologies Inc., Gaithersburg, 
MD 20877, USA. 

Electroporation is also useful for transfomung cells and is well known ia the art for 
30 transforming yeast cell, bacterial cells and vertebrate cells. Methods for 
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transformation of yeast by electroporation are disclosed in Becker & Guarente 
(1990) Methods EnzymoL 194, 182. 

Generally, the plasmid will transform not all of the hosts and it will therefore be 
5 necessary to select for transformed host cells. Thus, a plasmid may comprise a 
selectable marker, including but not limited to bacterial selectable marker and/or a 
yeast selectable marker. A typical bacterial selectable marker is the p-lactamase 
gene although many others are known in the art. Typical yeast selectable marker 
include LEU2, TUFl HIS3, HIS4, URA3, URA5, SFAl ADE2, METIS, LYS5, 

10 LYS2, ILV2, FBAl PSEl, PDIl and PGKL Those skilled in the art will 
appreciate that any gene whose chromosomal deletion or inactivation results in an 
mviable host, so called essential genes, can be used as a selective marker if a 
functional gene is provided on the plasmid, as demonstrated for PGKl in B.pgkl 
yeast strain (Piper and Curran, 1990, Curr. Genet. 17, 119). Suitable essential 

15 genes can be found within the Stanford Genome Database (SGD), 
http:://db.yeastgenome.org). Any essential gene product (e.g. PDIl, PSEl, PGKl 
or FBAl) which, when deleted or inactivated, does not result in an auxotrophic 
(biosynthetic) requirement, can be used as a selectable marker on a plasmid in a 
host cell that, in the absence of the plasmid, is unable to produce that gene 

20 product, to achieve increased plasmid stability without the disadvantage of 
requiring the cell to be cultured under specific selective conditions. By 
"auxotrophic (biosynthetic) requirement" we include a deficiency which can be 
complemented by additions or modifications to the growth medium. Therefore, 
preferred "essential marker genes" in the context of the present invention are those 

25 that, when deleted or inactivated in a host cell, result in a deficiency which cannot 
be complemented by additions or modifications to the growtbi medium. 

Additionally, a plasmid accordmg to any one of the first, second or third aspects 
of the present invention may comprise more than one selectable marker. 

30 
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One selection technique involves incorporating into the expression vector a DNA 
sequence marker, with any necessary control elements, that codes for a selectable 
trait iti the transformed cell. These markers include dihydrofolate reductase, G4 1 8 or 
neomycin resistance for eukaryotic cell culture, and tetracyclin, kanamycin or 
5 ampicniiii (i.e. p4actamase) resistance genes for culturing in Kcoli and other 
bacteria. Alternatively, the gene for such selectable trait can be on another vector, 
which is used to co-transform the desired host cell. 

Another method of identifying successfully transformed cells involves growing the 
10 cells resulting from the introduction of a plasmid of the invention, optionally to allow 
the expression of a recombinant polypeptide (i.e. a polypeptide which is encoded by 
a polynucleotide sequence on the plasmid and is heterologous to the host cell, in the 
sense that that polypeptide is not naturally produced by the host). Cells can be 
harvested and lysed and their DNA or RNA content examined for the presence of the 
15 recombinant sequence using a method such as that described by Southern (1975) J. 
Mol Biol 98, 503 or Berent et al (1985) Biotech 3, 208 or other methods of DNA 
and RNA analysis common in the art. Alternatively, the presence of a polypeptide ia 
the supernatant of a culture of a transformed cell can be detected usiag antibodies. 

20 In addition to directly assaying for the presence of recombinant DNA, successful 
transformation can be confirmed by well known immunological methods when the 
recombinant DNA is capable of directing the expression of the protein. For example, 
cells successfully transformed with an expression vector produce proteins displaying 
appropriate antigenicity. Samples of cells suspected of being transformed are 

25 harvested and assayed for the proteui using suitable antibodies. 

Thus, iQ addition to the transformed host cells themselves, the present invention also 
contemplates a culture of those cells, preferably a monoclonal (clonally 
homogeneous) culture, or a culture derived from a monoclonal culture, in a nutrient 
30 medium. Alternatively, transformed cells may represent an 
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mdustrially/commercially or pharmaceutically useful product and can be used 
without further purification or can be purified from a culture medium and optionally 
formulated vvifh a canier or diluent in a manner appropriate to their iatended 
industrial/coromercial or pharmaceutical use, and optionally packaged and presented 
5 in a mamier suitable for that use. For example, whole cells could be immobilised; or 
used to spray a cell culture directly on to/into a process, crop or other desired target. 
Similarly, whole cell, such as yeast cells can be used as capsules for a huge variety of 
applications, such as firagrances, flavours and pharmaceuticals. 

10 Transformed host cells may be cultured for a sufficient time and under appropriate 
conditions known to those skilled in the art, and in view of the teachings disclosed 
herein, to permit the expression of the chaperone and heterologous protein 
encoded by the plasmid. 

15 The culture medium may be non-selective or place a selective pressure on the 
maintenance of the plasmid. 

The thus produced heterologous protein may be present uitracellularly or, if 
secreted, in the culture medium and/or periplasmic space of the host cell. 

20 

The step of "purifying the thus expressed heterologous protein firom the cultured 
host cell or the culture medium" optionally comprises cell immobilization, cell 
separation and/or cell breakage, but always comprises at least one other 
purification step different firom the step or steps of cell immobilization, separation 
25 and/or breakage. 

Cell unmobilization techniques, such as encasing the cells using calcium alginate 
bead, are well known in the art. Sunilarly, cell separation techniques, such as 
centrifugation, filtration (e.g. cross-flow filtration, expanded bed chromatography 
30 and the like are well known in the art. Likewise, methods of cell breakage, 
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including beadmilling, sordcation, enzymatic exposure and the like are well 
known in the art. 

The at least one other purification step may be any other step suitable for protein 
5 purification Icnown in the art. For example purification techniques for the 
recovery of recombinantly expressed albumin have been disclosed in: WO 
92/04367, removal of matrix-derived dye; EP 464 590, removal of yeast-derived 
colorants; EP 319 067, alkaline precipitation and subsequent application of the 
albumin to a lipophilic phase; and WO 96/37515, US 5 728 553 and WO 
10 00/44772, which describe complete purification processes; all of which are 
incorporated herein by reference. 

Proteins other than albumin may be purified from the culture medium by any 
technique that has been found to be useful for purifymg such proteins. 

15 

Suitable methods include ammonium sulphate or ethanol precipitation, acid or 
solvent extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, afBnity chromatography, 
hydroxylapatiite chromatography, lectin chromatography, concentration, dilution, pH 
20 adjustment,, diafiltration, ultrafiltration, high performance liquid chromatography 
("HPLC"), reverse phase HPLC, conductivity adjustment and Ihe like. 

In one embodiment, any one or more of the above mentioned techniques may be 
used to further purifying the thus isolated protein to a commercially or industrially 

25 acceptable level of purity. By commercially or industrially acceptable level of 
purity, we include the provision of the protein at a concentration of at least 0.01 
g.U\ 0.02 g.L-\ 0.03 g.L-\ 0.04 g-U^ 0.05 g.L-\0.06 g.L-\0.07 g-U^ 0.08 g.L'^ 
0.09 g.L-\ 0.1 g.V\ 0.2'g.U\ 0.3 g.U\ 0.4 g-UK 0.5 g-UK 0.6 g.L-\ 0.7 g.U\ 0.8 
g.L-\ 0.9 g.U\ 1 g.L-\ 2 g.U\ 3 g.L-\ 4 g.U\ 5 g.U\ 6 g.U\ 7 g.L-\ 8 g-U^ 9 g-L" 

30 \ 10 gi:\ 15 g.L-\ 20 gi:\ 25 g.U\ 30 g.L-\ 40 g.L-^50 g-UK 60 g.U\ 70 g-UK 
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70 g.L-\ 90 gUK 100 g.U\ 150 g.U\ 200 g.L-^250 g.U\ 300 g,U\ 350 g.U\ 400 
g.L-^ 500 g.U\ 600 g.L"^ 700 g,U\ 800 g.L"\ 900 gX'^ 1000 gJ^'K or more. 

It is preferred that the heterologous protein is purified to achieve a pharmaceutically 
5 acceptable level of purity. A protein has a pharmaceutically acceptable level of 
purity is it is essentially pyrogen free and can be administered in a 
pharmaceutically efficacious amount without causing medical effects not 
associated with the activity of the protein. 

10 The resulting heterologous protein may be used for any of its known utilities, 
which, in the case of albumin, include i.v. administration to patients to treat severe 
bums, shock and blood loss, supplementing culture media, and as an excipient ia 
formulations of other proteins. 

15 Although it is possible for a therapeutically useful heterologous proteia obtained by 
a process of the of the inventioii to be admiaistered alone, it is preferable to present it 
as a phamiaceutical formulation, together with one or more acceptable carriers or 
diluents. The camer(s) or diluent(s) must be "acceptable" ia the sense of bemg 
compatible with the desired proteia and not deleterious to the recipients thereof. 

20 Typically, the carriers or diluents will be water or saline which will be sterile aad 
pyrogen firee, 

Optionally the thus formulated protein will be presented in a unit dosage form, 
such as in the form of a tablet, capsule, injectable solution or the like. 

25 

A further embodiment of the present invention provides a host cell recombinantly 
encodiag proteins comprismg the sequences of PDI and traasferrin-based proteins. 
By "transferrin-based protein" we mean transferrin or any other member of the 
transferrin family (e.g. lactoferrin), a variant or fragment thereof or a fusion 
30 protein comprising transferrin, a variant or fragment thereof, including the types 
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described above. Thus the present invention also provides for the use of a 
recombinant PDI gene to increase the expression of a transferrin-based protein. 

The PDI gene may be provided on a plasmid, such as a 2]am-family plasmid as 
5 described above. Alternatively, the PDI gene may be chromosomally mtegrated. 
In a preferred embodiment, the PDI gene is chromosomally integrated at the locus 
of an endogenously encoded PDI gene, - preferably without disrupting the 
expression of the endogenous PDI gene. In this context, "without disrupting the 
expression of the endogenous PDI gene" means that, although some decrease in 
10 the protein production from the endogenous PDI gene as a result of the integration 
may be acceptable (and preferably there is no decrease), the total level of PDI 
protein production m the modified host cell as a result of the combined effect of 
expression from the endogenous and integrated PDI genes is increased, relative to 
the level of PDI protein production by the host cell prior to the integration event 

15 

The gene encoding the transferrin-based protein may be provided on a plasmid, 
such as a 2|am-family plasmid as described above, or may be chromosomally 
integrated, such as at the locus of an endogenously encoded PDI gene, preferably 
without disrupting the expression of the endogenous PDI gene. 

20 

In one embodiment the PDI gene is chromosomally integrated and the gene 
encoding the transferrin-based protein is provided on a plasmid. In another 
embodiment, the PDI gene is provided on a plasmid and the gene encodmg the 
tratisfenin-based protein is chromosomally integrated. In. another embodiment 
25 both the PDI gene and the gene encoding the transferrin-based protein are 
chromosomally integrated. In another embodiment both the PDI gene and the 
gene encoding the transferrin-based protein are provided on a plasmid. 

As discussed above, Bao et al, 2000, Yeast, 16, 329-341 reported that over- 
30 expression of the K lactis PDI gene KlPDIl was toxic to K, lactis cells. Against 
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this background we have STirprisingly found that, not only is it possible to over- 
express PDI and other chaperones without the detrimental effects reported in Bad 
et ah but that two different chaperones can be recombinantly over-expressed in the 
same cell and, rather than being toxic, can increase the expression of heterologous 

5 proteins to levels higher than the levels obtained by individual expression of either 
of the chaperones. This was not expected. On the contrary, in light of die 
teaching of Bao et al, one would think that over-expression of two chaperones 
would be even more toxic than the over-expression of one. Moreover, in light of 
the earlier findings of the present invention, it was expected that the increases in 

10 heterologous protein expression obtained by co-expression with a single 
chaperone would be at the maximum level possible for the cell system used. 
Therefore, it was particularly surprising to find that yet further increases in 
heterologous protein expression could be obtained by co-expression of two 
different chaperones with the heterologous protein. 

15 

Accordingly, as a fifth aspect of the present invention there is provided a method 
for producing heterologous protein comprising providing a host cell (such as 
defmed above) comprisnig a first recombinant gene encoding a protein comprising 
the sequence of a first chaperone protein, a second recombinant gene encoding a 

20 protein comprising the sequence of a second chaperone protein and a thnd 
recombinant gene encoding a heterologous protein, whereiQ the first and second 
chaperones are different; culturmg the host cell in. a culture medium under 
conditions that allow the expression of the first, second and third genes; and 
optionally purifying ' the thus expressed heterologous protein from the cultured 

25 host cell or the culture mediimi; and further optionally, lyophilising the thus 
purified protein. 

The method may fijrther comprise the step of formulating the purified 
heterologous protein with a carrier or diluent and optionally presenting the thus 
30 formulated protein in a unit dosage form, in the manner discussed above. 
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The tenn "recombinant gene" includes nucleic acid sequences that operate 
independently as "stand alone" expressible sequences to produce an encoded 
protein or, in the alternative, nucleic acid sequences introduced that operate in 
combination with endogenous sequences (such as by uategration into an 
5 endogenous sequence so as to produce a nucleic acid sequence that is different to 
the endogenous sequence) within the host to cause increased expression of a target 
protein. 

The first and second chaperones may be a chaperone as discussed above, and are a 
10 combination of chaperones that, when co-expressed in the same host cell, provide 
an additive effect to the increase in expression of the heterologous protein. By 
"additive effect" we include the meaning that the level of expression of the 
heterologous proteha in the host cell is higher when the first and second 
recombinant genes are simultaneously co-expressed with the third recombinant 
15 gene as compared to the same system wherein (i) the fiirst recombinant gene is co- 
expressed with the third recombinant gene in the absence of the expression of the 
second recombinant gene and (ii) the second recombinant gene is co-expressed 
with the third recombinant gene in the absence of the expression of the first 
recombinant gene. 

20 

One preferred chaperone is protein disulphide isomerase. Another preferred 
chaperone is ORM2 or a fragment or variant thereof. In a particularly preferred 
embodiment, the first and second chaperones are protein disulphide isomerase and 
ORM2 or a fragment or variant thereof. 

25 

The first, second and third recombinant genes may each individually be present on 
a plasmid withm the host cell (such as a 2iLLm-family plasmid, as discussed above) 
or be chrqmosomally integrated within the genome of the host cell. It will be 
appreciated that any combination of plasmid and chromosomally integrated first, 
30 second and third recombinant genes may be used. For example, the first, second 
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and third recombinant genes may each individually be present on a plasmid, and 
this may be either the same plasmid or different plasmids. Alternatively, the first 
recombinant gene may be present on a plasmid, and second and third recombinant 
genes may be chromosomally integrated within the genome of the host cell. 

5 Altematively, the fnst and second recombinant genes may be present on a plasmid 
and the third recombmant gene may be chromosomally mtegrated within the 
genome of the host cell. Alternatively, the first and third recombinant genes may 
be present on a plasmid and the second recombinant gene may be chromosoinally 
integrated within the genome of the host cell. Alternatively, the first and second 

10 recombinant gene may be chromosomally integrated within the genome of the 
host cell and the third recombinant gene may be present on a plasmid. 
Alternatively, the first, second and third recombinant genes may each individually 
be chromosomally integrated within the genome of the host cell. 

15 Particularly preferred plasmids are those defined above in respect of earlier 
aspects of the present mvention. Accordingly, the present invention also provides 
a plasmid as defined above wherem the plasmid comprises two different genes 
(the first and second recombinant genes) encoding different chaperones. In one 
preferred embodiment, the plasmid may further comprise a gene encoding a 

20 heterologous protein (the third recombinant gene), such as a heterologous proteui 
as described above. 

In a sixth aspect of the present uivention there is provided a method for producmg 
a heterologous protein, such as a heterologous protein as defined above for an 

25 earUer aspect of the present invention, comprisuig: providmg a host cell 
comprising a first recombmant gene encoding the protein comprising the sequence 
of ORM2 or a variant thereof and a second recombinant gene encoding a 
heterologous protein; culturing the host cell in a culture medium under conditions 
that allow the expression of the first and second genes; and purifying the thus 

30 expressed heterologous protein firom the cultured host cell or the culture medium; 

and optionally, lyophiUsing the thus purified protein; and optionally formulating 
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the purified heterologous protein with a carrier or diluent; and optionally 
presenting the thus formulated proteha ha a unit dosage form.. 

hi the manner discussed above, the host cell may further comprise a further 
5 recombinant gene encoding a protein comprising the sequence of an alternative 
chaperone to ORM2 or a variant thereof 

Either or both of the first and second recombinant genes may be expressed from a 
plasmid, and preferably from the same plasmid. A further recombinant gene 
10 encoding a protein comprising the sequence of an alternative chaperone to ORM2 
or a variant thereof may also be expressed from a plasmid, preferably from the 
same plasmid as either or both of the first and second recombinant genes. The 
plasmid may be a 2|im-family plasmid, such as the 2|Ltm plasmid. 

15 The present invention also provides, ni a seventh aspect, for the use of a nucleic 
acid sequence encoding the protein ORM2 or a variant thereof to hicrease the 
production, in a host cell, of a heterologous protem encoded by a recombinant 
gene in the host cell by co-expression of the nucleic acid sequence and the 
recombinant gene within the host cell. Either or both of the nucleic acid sequence 

20 and the recombmant gene encoding the heterologous protein may be expressed 
from a plasmid v^thin the host cell, and preferably from the same plasmid. In the 
manner discussed above, the host cell may further comprise a recombmant gene 
. encodmg an alternative chaperone to ORM2 or a variant thereof, which may be 
located on a plasmid within Ihe host cell, preferably on the same plasmid as either 

25 or both of the nucleic acid sequence and the recombinant gene encoding the 
heterologous protein. Suitable plasmids include a 2pni-family plasmid, such as 
the 2|am plasmid, as discussed above. 

In an eighth^ aspect of the present invention there is also provided the use of a 
30 plasmid as an expression vector to mcrease the production of a heterologous 
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protein by providing a recombinant gene encoding the heterologous proteia and a 
gene encoding ORM2 or a variant thereof on the same plasmid. The plasmid may 
further comprise a gene encoding an altemative chaperone to ORM2 or a variant 
thereof in the manner discussed above. Suitable plasmids include a 2fmi-family 
5 plasmid, such as the 2]Lim plasmid, as discussed above. 

Accordingly, in a ninth aspect, the present invention also provides a plasmid, 
preferably an expression plasmid, comprising a first gene encoding the protein 
ORM2 or a variant or firagment thereof and a second gene encoding a 
10 heterologous protein, as discussed above. The plasmid may further comprise a 
third gene encoding an altemative chaperone to ORM2 or a variant thereof. In a 
preferred embodiment, the thnd gene encodes a protem comprising the sequence 
of protein disulphide isomerase. 

15 We have also demonstrated that a plasmid-bome gene encoding a protein 
comprising the sequence of an "essential" chaperone, such as PDI, can be used to 
stably mauitain the plasmid in a host cell that, in the absence of the plasmid, does 
not produce the chaperone, and simultaneously increase the expression of a 
heterologous protein encoded by a recombinant gene within the host cell. This 

20 system is advantageous because it allows the user to minimise the number of 
recombinant genes that need to be carried by a plasmid. For example, typical 
prior art plasmids carry marker genes (such as those as described above) that 
enable the plasmid to be stably maintained during host cell culturing process. 
Such marker genes need to be retained on the plasmid in addition to any furthier 

25 genes that are ^ required to achieve a desired effect. However, the ability of 
plasmids to incorporate exogenous DNA sequences is Ihnited and it is therefore 
advantageous to minmiise the number of sequence insertions required to achieve a 

« 

desired effect. Moreover, some marker genes (such as auxotrophic marker genes) 
require the ciilturing process to be conducted under specific conditions in order to 
30 obtain the effect ofthe marker gene. Such specific conditions may not be optimal 
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for cell growth or protein production, or may require inefficient or unduly 
expensive growth systems to be used. 

For the purpose of increasing heterologous gene expression, we have found that it 
5 is possible to use a gene that recombinantly encodes a protein comprising the 
sequence of an "essential" chaperone for the dual purpose of increasing the 
production of a heterologous protein in a host cell and in the role of a selectable 
marker on a plasmid, where the plasmid is present within a cell that, in the 
absence of the" plasmid, is unable to produce the chaperone. This system has the 
10 advantage that it minimises the number of recombiuaat genes that need to be 
carried by the plasraid. The system also has the advantage that the host cell can be 
cultured under conditions that do not have to be adapted for any particular marker 
gene, wdthout loosing plasmid stability. For example, host cells produced using 
this system can be culture ia rich media, which may be more economical than the 
15 minimal media that is commonly used to give auxotrophic marker genes their 
effect. 

Accordingly, in a tenth aspect, the present invention also provides a host cell 
comprising a plasroid, the plasmid comprising a gene that encodes an essential 
20 chaperone wherein, in the absence of the plasmid, the host cell is unable to 
produce the chaperone. Preferably, in the absence of the plasmid, the host cell is 
inviable. The host cell may further comprise a recombinant gene encoding a 
heterologous proteia, such as those described above in respect of earlier aspects of 
the invention. 

25 . ■ • 

The present invention also provides, in a eleventh aspect, a plasmid comprising, as 
the sole selectable marker, a gene encoding an essential chaperone. The plasmid 
may further comprise a gene encoding a heterologous protein. The plasmid may 
be a 2iam-family plasmid. 

30 
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The present invention also provides, in a twelfth aspect, a method for producing a 
heterologous protein comprising the steps of: providing a host cell comprising a 
plasmid, Hxe plasmid comprising a gene that encodes an essential chaperone 
wherein, in the absence of the plasmid, the host cell is unable to produce the 

5 chaperone and wherein the host cell further comprises a recombinant gene 
encoding a heterologous protein; culturing the host cell in a culture medium under 
conditions that allow the expression of the essential chaperone and the 
heterologous protein; and optionally purifying the thus expressed heterologous 
protein from the cultured host cell or the culture medium; and further optionally, 

10 lyophilising the thus purified protein. 

The method may further comprise the step of formulating the purified 
heterologous protein with a carrier or diluent and optionally presenting the thus 
formulated protein in a unit dosage form, in the manner discussed above. In one 
15 preferred embodiment, the method involves culturing the host cell in non-selective 
media, such as a rich media. 

We have surprising also found that different PDI genes have the ability to increase 
the expression of heterologous proteins by different amounts under particular 
20 culture conditions. In particular, as discussed in Example 8, we have shown that 
the SKQ2n PDIl gene provides for higher heterologous protein expression than 
the S288c PDIl gene, when the host cells are cultured in minimal media. 

The sole difference between the encoded proteins of the SKQ2n PDIl and S288c 
25 PDIl genes is that SKQ2n comprises the additional amino acids EADAEAEA at 
positions 506-513 (positions as defined with reference to Genbank accession no. 
CAA38402, as given above). 

The differences between the gene sequences used are shown in the sequence 
30 alignment given in Figure 94 and can be surmnarised as follows 
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• The promoter of SKQ2n includes a run of fourteen "TA" repeats, whereas 
the promoter of S288c only has twelve "TA" repeats; 

• Ser41 is encoded by TCT in SKQ2n, but by TCC in S288c; 

5 

• Glu44 is encoded by QAA in SKQ2n but by GAG in S288c; 

• Leu262 is encoded by TTG in SKQ2n but by TTA in S288c; 

10 • Asp514 is encoded by GAG in SKQ2n but the homologous Asp506 is 

encoded by GAT in S288c; 

• The terminator sequence of SKQ2n contains a run of 8 consecutive "A" 
bases, whereas the terminator sequence of S288c contains a run of 7 

15 consecutive "A" bases and does not include an "A" base at the equivalent 

of position 1880 in the SKQ2n gene; 

• The terminator sequence of SKQ2n has a "C" at position 1919, whereas 
the terminator sequence of S288c has a "T" at the equivalent position. 

20 

It may be advantageous to include any or all of the above mentioned features of 
the SKQ2n gene in a PDI gene of choice, in order to achieve the observed increase 
in heterologous protein expression when the host cells are cultured in minimal 
media. 

25 

Accordingly, in a thirteenth aspect, there is also provided a nucleotide sequence 
encoding a protein disulphide isomerase, for use in iacreasing the expression of a 
heterologous protein in a host cell by expression of the nucleotide sequence within 
the host cell, which host cell is cultured in minimal media, wherein the nucleotide 



75 



wo 2005/061718 



PCT/GB2004/005462 



sequence encoding the protein disulphide isomerase is characterised in that it has 
at least one of the following characteristics — 

• the nucleotide sequence comprises a promoter having the sequence of a 
5 natural PDI promoter or a functional variant thereof and comprises a run of 

fourteen "TA" repeats; or 

• the encoded protein disulphide isomerase comprises the amino acids 
EADAEAEA or a conservatively substituted variant thereof^ typically at 
positions 506-513 as defined with reference to Genbank accession no. 

10 CAA38402; or 

• residue Ser41 of the encoded protein disulphide isomerase is encoded by 
the codon TCT; or 

• residue Glu44 of the encoded protein disulphide isomerase is encoded by 

the codon GAA; or 

15 • residue Leu262 of the encoded protein disulphide isomerase is encoded by 

codon TTG; or 

• residue Asp514 of the encoded protein disulphide isomerase is encoded by 
codon GAC; or 

• the nucleotide sequence comprises a terminator sequence having the 
20 sequence of a natural PDI terminator or a functional variant thereof and 

either comprises a run of 8 consecutive "A" bases and/or the base "C" at 
position 1919 (as defined by reference to position 1919 of the natural 
SKQ2n terminator sequence). 

25 The present invention also provides, in a fourteenth aspect, a method for 
producing a heterologous protein comprising the steps of: providing a host cell 
comprising a recombinant gene that encodes a protein disulphide isomerase and 
having the sequence of the above-defined nucleic acid sequence, the host cell 
further comprising a recombinant gene encoding a heterologous protein; culturing 

30 the host cell in a minimal culture medium under conditions that allow^ the 
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expression of the protein disulphide isomerase and the heterologous protein; and 
optionally purifying the thus expressed heterologous protein from the cultured 
host cell or the culture medium; and ftirther optionally, lyophiUsing the thus 
pinrified protein; and optionally further formulating the purified heterologous 
5 protein with a carrier or diluent; and optionally presenting the thus formulated 
protein in a unit dosage form, in the manner discussed above. 

The genes encoding the PDI and heterologous protein can be provided in the 
manner described above in respect of other embodiments of the present invention. 

10 

We have also found that the effects of recombinantly-provided chaperones 
according to the other embodiments of the present invention can be modulated by 
modifying the promoters that control the expression levels of the chaperone(s). 
Surprisingly we have found that, in some cases, shorter promoters result in 

15 mcreased heterologous protein expression. Without being bound by theory we 
beUeve that this is because the expression of a recombinant chaperone in host cells 
that already express heterologous proteins at high levels can cause the cells to 
overload themselves with heterologously expressed protein, thereby achievmg 
litde or no overall increase in heterologous protein production. In those cases, it 

20 may be beneficial to provide recombinant chaperone genes with truncated 
promoters. 

Accordingly, in a fifteenth aspect of the present invention there is provided a 
polynucleotide (such as a plasmid as defined above) comprising the sequence of a 

25 promoter operably connected to a coding sequence encoding a chaperone (such as 
those described above), for use in increasing the expression of a heterologous 
protein (such as those described above) in a host cell (such as those described 
above) by expression of the polynucleotide sequence within the host cell, wherein 
the promoter is characterised in that it achieves a modified, such as a higher or 

30 lower, level of expression of the chaperone than would be achieved if the coding 
sequence were to be operably connected to its naturally occurring promoter. 
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The present invention also provides, in a sixteenth aspect, a method for producing 
a heterologous protein comprising the steps of: providing a host cell comprising a 
recombinant gene that comprising the sequence of promoter operably connected to 

5 a coding sequence encoding a chaperone, the promoter being characterised in that 
it achieves a lower level of expression of the chaperone than would be achieved if 
the coding sequence were to be operably connected to its naturally occurring 
promoter, and the host cell further comprising a recombinant gene encoding a 
heterologous protein; culturmg the host cell under conditions that allow the 

10 expression of the chaperone and the heterologous protein; and optionally purifying 
the thus expressed heterologous protein from the cultured host cell or the culture 
medium; and further optionally, lyophihsing the thus purified protein; and 
optionally further formulating the purified heterologous protein with a carrier or 
diluent; and optionally presenting the thus formulated protein in a uaait dosage 

15 form, in the manner discussed above. 

As is apparent from the examples of the present application, the combination of 
recombinanfly expressed PDI and transferrin-based proteins provides a 
surprisingly high level of transferrin expression. For example, transferrin 
20 expression in a system that includes a chromosomally encoded recombinant PDI 
gene provided a 2-fold increase (compared to a control in which there is no 
chromosomally encoded recombinant PDI gene). This increase was 5-tmies 
greater than an equivalent system comprising a recombinant gene encoding human 
albumin in place of the recombinant transferrin gene. 

25 

The host may be any cell type, such as a prokaryotic cell (e.g. bacterial cells such 
as E, coll) or a eukaryotic cell. Preferred eulcaiyotic cells include fungal cells, 
such as yeast cells, and mammalian cells. Exemplary yeast cells are discussed 
above. Exemplary mammalian cells include human cells. 

30 
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Host cells as described above can be cultured to produce recombinant transferrin- 
based proteins. The thus produced transferrin-based proteins can be isolated from 
the culture and purified, preferably to a pharmaceutically acceptable level of 
purity, for example using techniques known in the art and/or as set out above. 
5 Purified transferrin-based proteins may be formulated with a pharmaceutically 
acceptable carrier or diluent and may be presented in unit dosage form. 

The present invention will now be exemplified with reference to the following 
non-limiting examples and figures. 

10 

BRIEF DESCRIPTION OF THE FIGURES 

Figures 1, 2, 4, 6 to 15, 22, 25, 27 to 52, 57 to 71, 74, 75, 77 to 79, 81 to 83, 85 

to 91, 95 and 96 show various plasmid maps. 

15 

Figure 3 shows plasmid insertion sites. 

Figure 5 shows a restriction map of a DNA fragment contaiiiing the PDI coding 
sequence. 

.20 

' Figure 16 shows the results of rocket iromunoelectrophoresis (RIE) determination 
of increased recombinant transferrin (N413Q5 N61 IQ) secretion with PDIl over- 
expression. Cryopreserved yeast stocks were grown for 4-days in lOmL BMMD 
shake flask cultures and supematauts were loaded at 5^L per well. Goat 

25 polyclonal anti-transferrin (human) antiserum (Calbiochem) was used at 40jlxL per 
rocket Immunoelectrophoresis gel (50mL). A = Control strain [pSAC35], 
duplicate flasks; B = Control strain [pDB2536]5 duplicate flasks; C == Control 
strain [pDB2711], neat to 40-fold aqueous dilutions; D = Control strain 
[pDB2931], duplicate flasks; E = Control strain [pDB2929], neat to 40-fold 

30 aqueous dilutions. 
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Figure 17 shows the results of RIE analysis of recombinant transferrin (N413Q, 
N611Q) secretion with and without PDIl over-expression. Ciyopreserved yeast 
stocks were grown for 4-days in lOmL BMMD shake flask cultures and 

5 supematants were loaded at S^iL per well. Duplicate loadings were made of 
supematants from two individual cultures of each strain. Goat polyclonal anti- 
transferrin (human) antiserum (Calbiochem) was used at 40|j.L per rocket 
immunoelectrophoresis gel (50mL). A = Control strain [pSAC35]; B = Control 
strain [pDB2536]; C = Control strain [pDB2711]; D = Control strain [pDB2931]; 

10 E = Control strain [pDB2929]. 

Figure 18 shows the results of SDS-PAGB analysis of recombinant transferrin 
secretion with and without PDIl over-expression. BMMD shake flask cultures 
were grown for 4-days and 10|j.L supernatant analysed on non-reducing SDS- 
15 PAGE (4-12% NuPAGE®, MOPS buffer, hiVitrogen) with GelCode® Blue 
reagent (Pierce). SeeBlue Plus2 Markers (InVitrogen). 1 = pDB2536; 2 = 
. pDB2536; 3 = pDB2711; 4 = pDB2711; 5 = pDB2931; 6 = pDB2931; 7 = 
pDB2929; 8 = pDB2929; 9 = pSAC35 control. 

20 Figure 19 shows RIE analysis of recombinant transferrin secretion from S. 
cerevisiae strains with an additional integrated copy of PDIL 5-day BMMD shake 
flask culture supematants were loaded at SmL per well. Strains contained: 1) 
pSAC35 (negative control); 2) pDB2536 (recombinant non-glycosylated 
transferrm (N413Q, N611Q)) or 3) pDB2506 (same as plasmid pDB2536 but the 

25 fransferrm ORF encodes transferrin without the N-^Q mutations at positions 413 
and 611, i.e. recombinant glycosylated transferrin). Each well contained a sample 
derived from an individual transformant. Standards were human plasma holo- 
transferrin (Calbiochem) at 100, 50, 20, 10, 5 and 2mg.L"^ 
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Figure 20 shows RIE analysis of recombinant transferrin secretion ftom Strain A 
[pDB2536] and Strain A [pDB2506] grown in shake flask culture. 5-day BMMD 
or YEPD shake flask culture supematants were loaded in duplicate at 5mL per 
well. 

5 

Figure 21 shows SDS-PAGE analysis of recombinant transferrin secreted &om 
Strain A [pDB2536] and Strain A [pDB2506] grown in shake flask culture. 
Cultures were grown for 5-days in BMMD and SOmL supematants analysed on 
SDS-PAGE (4-12% NuPAGE™, MOPS Buffer, InVitrogen) stained with 
10 GelCode, Blue Reagent (Pierce). 1) Strain A [pDB2536] transformant 1; 2) Strain 
A [pDB2536] transformant 2; 3) Strain A [pSAC35] control; 4) Strain A 
[pDB2506] transformant 1; 5) SeeBlue, Plixs2 Protein Standards (approximate 
molecular weights only). 

15 Figure 23 shows RIE of recombinant transferrin secreted from 5'. cerevisiae 
Strains with different PDIl copy numbers. 3 -day BMMD shalce flask culture 
supematants were loaded at 5mL per well. Goat polyclonal anti-transferrin 
(human) antiserum (Calbiochem) was used at 30mL per rocket 
Immunoelectrophoresis gel (50mL). (A) supernatant from S. cerevisiae control 

20 strahx [pDB2711] or [pDB2712]; (B) supernatant from Strain A [pDB2536]; (C) 
supernatant from control strain [pDB2536]. 

Figure 24 shows SDS-PAGE analysis of recombinant transferrin secreted from S. 
cerevisiae strams with different PDIl copy numbers. 4-12% NuPAGE reducing 
25 gel run with MOPS buffer (InVitrogen) after loadmg with 30mL of 3-day BMMD 
shake flask culture supernatant per lane; (lane 1) supernatant from control strain 
[pDB2536]; (lane 2) supernatant from Strain A [pDB2536]; (lanes 3-6) 
supernatant from control strain [pDB2711] or [pDB2712]; (lane 7) molecular 
weight markers (SeeBlue Plus2, InVitrogen). 

30 
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Figure 26 shows RIE of recombinant transferrin secreted from different S. 
cerevisiae strains with and without additional PDIl gene co-expression. lOmL 
YEPD shalce flasks were inoculated with yeast and incubated for 4-days at 30^C. 
SfxL culture supernatant loaded per well of a rocket immunoelectrophoresis gel. 
5 Plasma Tf standards concentrations are in jag/mL. 20\xL goat anti-Tf / 50mL 
agragose. Precipin was stained with Coomassie blue. 

Figure 53 shows RIE analysis of rHA expression in different S, cerevisiae strains 
when co-expressed with PDIl genes having different length promoters. lOmL 
YEPD shake flasks were inoculated with yeast and incubated for 4-days at 30^C. 
4jj.L culture supernatant loaded per well of a rocket immunoelectrophoresis gel, 
rHA standards concentrations are in |Lig/mL. 400 (iL goat anti-HA (Sigma product 
A-1151 resuspended in 5mL water) /50mL agarose. Precipin was stained with 
Coomassie blue. 

Figure 54 shows RIE analysis of rHA expression in different S. cerevisiae strains 
when co-expressed with PDIl genes having different length promoters. lOmL 
YEPD shake flasks were inoculated with yeast and incubated for 4-days at 30°C. 
4fiL culture supernatant loaded per well of a rocket immunoelectrophoresis gel. 
rHA standards concentrations are in jLig/mL. 400jliL goat anti-HA (Sigma product 
A-1151 resuspended in 5mL water) /50mL agarose. Precipin was stained with 
Coomassie blue. 

Figure 55 shows RIE analysis of rTF expression, when co-expressed with 
25 different PDIl constructs. iOmL BMMD shake flasks were inoculated with yeast 
and incubated for 4-days at 30^C. 5\\L culture supernatant was loaded per well of 
a rocket immimoelectrophoresis gel containing 25\xL goat anti-Tf / 50mL. Plasma 
Tf standards concentrations are in |j,g/mL. Precipin was stained wdth Coomassie 
blue. 

30 
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Figure 56 shows RIE analysis of rTF expression, when co-expressed with 
different PDIl constructs. lOmL YEPD shake flasks were inoculated with yeast 
and incubated for 4-days at 30^C. 5fxL culture supernatant was loaded per well of 
a rocket immunoelectrophoresis gel containing 25 [xL goat anti-Tf / 50mL. Plasma 
5 Tf standards concentrations are in |LLg/mL. Precipin was stained with Coomassie 
blue. 

Figure 72 shows RIE analysis of rHA fusion proteins with and without co- 
expressed recombinant PDIl. lOmL BMMD shake flasks were inoculated with 
10 YBX7 transformed with albumin fusion expression plasmids and incubated for 4- 
days at 30^C. 4jiL culture supernatant loaded per well of a rocket 
immunoelectrophoresis gel. rHA standards concentrations are in jig/mL. 200|a,L 
goat anti-HA (Sigma product A-1151 resuspended in 5mL water) /50mL agarose. 
Precipin was stained with Coomassie blue. 

15 

Figure 73 shows SDS-PAGE analysis of recombinant albumin fusion secretion 
with and without PDIl present on the expression plasmid. 1 OroL BMMD shake 
flasks were inoculated with yeast and incubated for 4-days at SO^C, 200rpm. SOfiL 
supernatant analysed on non-reducing SDS-PAGE (4-12% NuPAGE®, MES 

20 buffer, InVitrogen) with GelCode® Blue reagent (Pierce). 1 = SeeBlue Plus2 
Markers (InVitrogen); 2 = l^Lg rHA; 3 = angiostatm-rHA; 4 = angiostatm-rHA + 
FDII; 5 == endostatin-rHA; 6 = endostatm-rHA + FDIJ; 7 = DX-890-(GGS)4GG- 
rHA; 8 = DX-890-(GGS)4GG-rHA + PDIl; 9 = DPM4-(GGS)4GG-rHA; 10.- 
DPI-14-(GGS)4GG-rHA + PZ)/i; 11 - Axokine™ (CNTFAxi5)-(GGS)4GG-rHA 

25 (Lambert et al, 2001, Proc, Natl Acad, Set USA, 98, 4652-4657); 12 == 
Axokine™ (CNTFaxis) -(GGS)4GG-rHA + PD/i . 

Figure 76 shows RIE analysis demonstrating increased transferrin secretion from 

S, cerevisiae with ORM2 co-expression from a 2jim-based plasmid. Four day 

30 shake flask culture supemantants were loaded at 5\il per well. Standards were 
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hximaa plasma holo-transferrin (Calbiochem), at 25, 20, 15, 10, 5 i-ig/ml, loaded 
5\xl per well. Goat polyclonal anti-transferrin (human) antiserum (Calbiochem) 
used at 20 fil per rocket immunoelectrophoresis gel (50 ml). 

5 Figure 80 shows RIE analysis demonstrating increased transferrin secretion from 
S. cerevisiae with PSEI co-expression from a 2j-im-'based plasmid. Four day 
shake flask culture supemantants were loaded at 5p,l per well. Standards were 
human plasma holo-transferrin (Calbiochem), at 25 , 20, 15, 10, 5 |ag/ml, loaded 
5(il per well. Goat polyclonal anti-transferrin (human) antiserum (Calbiochem) 
10 used at 20|al per rocket immunoelectrophoresis gel (50 ml). 

Figure 84 shows RIE analysis demonstrating increased transferrin secretion from 
S. cerevisiae with SSAl co-expression from a 2|Lun-based plasmid. Four day shake 
flask culture supemantants were loaded at 5\il per well. Standards were human 
15 plasma holo-transferrin (Calbiochem), at 25, 20, 15, 10, 5 jag/ml, loaded 5|li1 per 
weU. Goat polyclonal anti-transferrin (human) antiserum (Calbiochem) used at 
20fxl per rocket immunoelectrophoresis gel (50 ml). 

Figure 92 shows the results of RIE. 1 OmL YEPD shake flasks were inoculated 
20 with DXYl ti^plA [pDB2976], DXYl tiplA [pDB2977], DXYl tiylA 
[pDB2978], DXYl trplA [pDB2979], DXYl trplA [pDB2980] or DXYl tiplA 
[pDB2981] transformed to tryptophan prototrophy with a 1.41kb NotVPstl 
pdil::TKPl dismpting DNA fragment was isolated from pDB3078. 
Transformants were grown for 4-days at 30^C, 200rpm. 4|xL culture supernatant 
25 loaded per well of a rocket immunoelectrophoresis gel. rHA standards 
concentrations are in p,g/mL. 700)llL goat anti-HA (Sigma product A-1151 
resuspended in 5mL water) /50mL agarose. Precipin was stained with Coomassie 
blue. Isolates selected for frirther analysis are indicated (*). 
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Figure 93 shows the results of RJE. lOmL YEPD shake flasks were inoculated 
with DXYl [pDB2244], DXYl [pDB2976], DXYl ti^plA pdil::TRPl 
[pDB2976], DXYl [pDB2978], DXYl' ttplA pdil::TRPl [pDB2978], DXYl 
[pDB2980], DXYl tiplA pdil::TRPl [pDB2980], DXYl [pDB2977], DXYl 

5 tiplA pdil::TRPl [pDB2977], DXYl [pDB2979] DXYl tiplA pdil::TRPl 
[pDB2979], DXYl [pDB2981] and DXYl PplA pdil::TRPl [pDB2981], and 
were grown for 4-days at 30^C, 200rpm. 4|aL culture supernatant loaded per well 
of a rocket immunoelectrophoresis gel. rHA standards concentrations are in 
(ag/mL. 800|aL goat anti-HA (Sigma product A-1 151 resuspended m SmL water) 

10 /50mL- agarose. Precipin was stained with Coomassie blue. Isolates selected for 
further analysis are indicated 

Figure 94 shows a sequence alignment of the SKQ2n and S288c gene sequences 
with long promoters, as described in Example 6. 

15 

EXAMPLES 

Two types of expression cassette have been used to exempUfy secretion of a 
recombinant human transferrin mutant (N413Q, N61.1Q) from & cerevisiae. One 
20 type uses a modified HSA(pre)/MFal(pro) leader sequence (named the "modified 
fusion leader" sequence). The second type of expression cassette uses only the 
modified HSA(pre) leader sequence. 

The 24 amino acid sequence of the "modified fusion leader" is 
25 MKWVFIVSILFLFSSAYSRSLDKR. 

The 18 amino acid sequence of the modified HSA(pre) leader sequence is 
MKWVFIVSILFLFSSAYS. 
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Transferrin (N413Q, N611Q) expression using these two cassettes has been 
studied in S cerevisiae using the 2jxm expression vector with and without an 
additional copy of the S. cerevisiae PDI gene, PDU, 

5 EXAMPLE 1 

Construction of expression plasmids 

A 52-bp linker made by annealing O.SmM solutions of oligonucleotides CF86 and 
10 CF87 (see below) was introduced into the US-region of the 2(im plasmid pSAC35 
at the Xcml-sitos in the 599-bp mverted repeats. One Xcml-sitQ cuts 51 -bp after 
the REP 2 translation termination codon, whereas the other Xcml-site cuts 127-bp 
before the end of the FLF coding sequence, due to overlap wdth the inverted repeat 
(see Figure 3). This DNA linker contained a core region "SndBl-Pacl-FseVSfil- 
15 Smal-SndBY\ which encoded restriction sites absent jfrom pSAC35. 

Xcml Linker rCF86+CF87) 

SflT 

20 Pad SnaBl 

SnaBl Fsel Smal 

CFB6 GGAGTGGTA CGTATTAATT AAGGCCGGCC AGGCCCGGGT ACGTACCAAT TGA 
25 CF87 TCCTCACCAT GCATAATTAA TTCCGGCCGG TCCGGGCCCA TGCATGGTTA AC 

Plasmid pSAC35 was partially digested with Xcml^ the linear 1 1 -kb jQragment was 
isolated from a 0.7%(w/v) agarose gel, ligated with the CF86/CF87 Xcml linker 
(neat, 10"^ and 10"^ dilutions) and transformed into E. colt DH5a. Ampicillin 
30 resistant transformants were selected and screened for the presence of plasmids 
that could be linearised by Smal digestion. Restriction enzyme analysis identified 
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pDB2688 (Figure 4) with the linker cloned mto the Xc7?2l-site after REP2. DNA 
sequencing using oligonucleotides primers CF88, CF98 and CF99 (Table 1) 
confirmed the insertion contained the correct linker sequence. 

5 Table 1 



Oligonucleotide sequencing primers: 



Primer 


Description 


Sequence 


CF88 


REP2 primer, 20mer 


5'-ATCACGTAATACTTCTAGGG-3 ' 


CF98 


REP2 primer, 20mer 


5 ' -AG AGTGAGTTGGAAGGAAGG-3 ' 

* 


CF99 


REP2 primer, 20mer 


5'-AGCTCGTAAGCGTCGTTACC-3 ' 



10 The yeast strain was transformed to leucme prototrophy using a modified lithium 
acetate method (Sigma yeast transformation kit, YEAST- 1, protocol 2; (Ito et al, 
1983, J. Bacteriol, 153, 163; Elble, 1992, Biotechniques , 13, 18)). Transformants 
were selected on BMMD-agar plates, and were subsequently patched out on 
BMMD-agar plates. Cryopreserved trehalose stocks were prepared firom lOmL 

15 BMMD shake flask cultures (24 hrs, 30°C, 200rpm), by addition of an equal 
volume of sterile 40% (w/v) trehalose 

The composition of YEPD and BMMD is described by Sleep et al, 2002, Yeast, 
18, 403. YEPS and BMMS are similar in composition to YEPD and BMMD 
20 accept that 2% (w/v) sucrose was substituted for the 2% (w/v) glucose as the sole 
initial carbon source. 
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The S. cerevisiae PDIl gene was cloned into the Xcml-linker of pDB2688. The 
PDIl gene (Figure 5) was cloned on a 1.9-kb Sacl-Spel fragment from a larger S. 
cerevisiae genomic SKQ2n DNA fragment containing the PDIl gene (as provided 
in the plasmid pMA3a:C7 that is described in US 6,291,205 and also described as 

5 Clone C7 in Crouzet & Tuite, 1987, Mol Gen. Genet, 210, 581-583 and Farquhar 
et ah 1991, supra), which had been cloned into YIplac211 (Gietz & Sugino, 1988, 
Gene, 74, 527-534), and had a synthetic DNA linker containing a Sad restriction 
site inserted at a unique Bsu36I-sit& in the 3' untranslated region of the PDIl gene. 
The 1.9-kb Sacl-Spel fragment was treated with T4 DNA polymerase to fill the 

10 Spel 5'-overhang and remove the Sad 3'-overhang. This PDIl fragment included 
212-bp of the PDIl promoter upstream of the translation initiation codon, and 
148-bp downstream of the translation termination codon. This was ligated with 
Smal liaearised/calf intestinal alkaline phosphatase treated pDB2688, to create 
plasmid pDB2690 (Figure 6), with the PDIl gene transcribed in the same 

15 direction as REP2. A S. cerevisiae straia was transformed to leucine prototrophy 
withpDB2690. 

An expression cassette for a human transferrin mutant (N413Q, N611Q) was 
subsequently cloned into the M?fl-site of pDB2690 to create pDB271 1 (Figure 7). 

20 The expression cassette in pDB271 1 contains the S. cerevisiae PRBl promoter, an 
HSAMFa fiision leader sequence (EP 387319; Sleep et al, 1990, Biotechnology 
(N.Y.), 8, 42) followed by a coding sequence for the human transferrin mutant 
(N413Q, N611Q) and the S. cerevisiae ADHl terminator. Plasmid pDB2536 was 
constructed similarly by insertion of the same expression cassette into the N^otl- 

25 site ofpSAC35. 

The "modified frision leader" sequence used in pDB2536 and pDB271 1 comprises 
a modified HSA-pre sequence and a MFal-pro sequence. An alternative leader 
sequence used was the modified HSA-pre sequence, which was derived from the 
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modified fusion leader sequence by removal of the six residues of the MFal-pro 
sequence. 

The modified fusion leader sequence in pDB2515 (Figure 8) was mutated with 
5 oligonucleotides CF154 and CF155 to delete the coding sequence for the six 
residues (RSLDKR) of the MFal-pro region. This was performed according to 
the instruction manual of the Statagene's QuickChange'^'^ Site-Directed 
Mutagenesis Kit. pDB2515 is the E, coli cloning vector pGEM-7Z(-) (Promega) 
containing the 2940~bp Notl-Hindlll (partial) DNA fragment of pDB2529 (see 
10 below) ligated between the PspOMi and HindHI sites. 

CF154 

5'-GTTCTTGTTCTCCTCTGCTTACTCTGTCCCTGATAAAACTGTGAGATGG-3'' 
15 CF155 

5 ' -CCATCTCACAGTTTTATCAGGGACAGAGTAAGCAGAGGAGAACAAGAAC-3 ' 

Competent E. coli DH5a cells were transformed with the mutated plasmids and 
: ampicillin resistant colonies were selected. Plasmid DNA from these colonies 

20 was screened by double digestion with jBcoRI aud BglR, The correct DNA 
sequence for the modified HSA-pre leader was subsequently confirmed in 
pDB2921 (Figure 9) over a 386-bp region between the AflU and BarnHL sites 
either side of the leader sequence. This 386-bp AflU-BamiU. fragment was 
isolated, and ligated with a 6^081 -bp Afm-BamBI fragment from pDB2529 

25 (Figure 10), prepared by partial digestion with BamiJI and complete digestion 
with4/?II and calf intestinal alkaline phosphatase. pDB2529 is the E. coli cloning 
vector pBST(4-) (Sleep et al^ 2001, Yeasty 18, 403-441) containing the transferrin 
expression cassette of pDB2536 cloned into the unique iVbfl-site. This produced 
pDB2928 (Figure 11), which was isolated from ampicillin resistant E, coli DH5a 

3 0 cells transformed with the ligation products. 
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The 35256-bp Notl expression cassette was isolated firom pDB2928. This 
contained the PRBl promoter, the coding region for the modified HS A-pre leader 
sequence followed by transferrin (N413Q5 NdllQ), and the ADHl terminator. 
5 This was ligated into the Notl sites of the 2|im-based vectors pSAC35 and 
pDB2690 to generate the expression plasmids pDB2929, pDB2930, pDB2931 and 
pDB2932 (Figures 12-15). In pDB2929 and pDB2931 the transferrm (N413Q, 
N611Q) sequence is transcribed in the same direction as LEU2y whereas in 
pDB2930 and pDB2932 transcription is in the opposite direction. 

10 

EXAMPLE! 

Egression of transferrin 

15 A jS cerevisiae control strain was transformed to leucine prototrophy with all the 
transferrin (N413Q, N611Q) expression plasmids, and ciyopreserved stocks were 
prepared. 

Strains were grown for four days at 30°C in lOmL BMMD cultures in 50mL 
20 conical flasks shaken at 200rpm. The titres of recombinant transferrin secreted 
into the culture supematauts were compared by rocket immunoelectrophoresis 
(RIE as described in Weeke, B., I9765 "Rocket immunoelectrophoresis" /n N. H. 
Azelsen, J. KroU, and B. Weeke [eds.], A manual of quantitative 
immunoelectrophoresis. Methods and applications. Universitetsforlaget, Oslo, 
25 Norway), reverse phase high performance liquid chromatography (RP-HPLC) 
(Table 2X and non-reducing SDS polyacrylamide electrophoresis stained with 
colloidal Coomassie blue stain (SDS-PAGE). The increase in recombinant 
transferrin secreted when S, cerevisiae PDIl was over-expressed was estimated to 
be greater than 10-fold. 

30 
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Table 2: 




RIE analysis indicated that the increased transferrin secretion in the presence of 
additional copies of PDIl was approximately 15-fold (Figure 16). By RIE 
analysis the increase appeared sHghtly larger for the modified HSA-pre leader 
sequence titian for the modified fusion leader sequence (Figure 17). 

By RP-HPLC analysis the increase in transferrin secretion was determined to be 
18-fold for the modified fusion leader sequence and 15-fold for the modified 
HSA-pre leader sequence (Table 2). 
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Figure 18 shows an SDS-PAGE comparison of the recombinant transferrin 
secreted by S. cerevisiae strains with and without additional PDIl expression. 

RP-HPLC Method for Determining Transferrin Expression 
5 Column: 50 x 4.6mm Phenomenex Jupiter C4 3 00 A, 5|j,m 
Column temperature: 45°C 
Flow rate: lmL.min"^ 
Peak detection: UV absorbance at 214nm 
HPLC mobile phase A:0.1% TFA, 5% Acetonitrile 
10 HPLC mobile phase B:0.1% TFA, 95% Acetonitrile 
Gradient: 0 to 3 minutes 3 0% B 

3 to 13 minutes 30 to 55% B in a linear gradient 

13 to 14 minutes 55% B 

14 to 15 minutes 55 to 30% B in a linear gradient 
15 1 5 to 20 minutes 30% B 

Injection: Generally 100|aL of sample, but any volume can be injected 
Standard Curve; 0. 1 to 1 Ojiig of human transferrin injected vs peak area 
Standard curve used for the results shown was linear up to 10|ig. 

y = 530.888.x + 10526.7 
20 where y = peak area, and x = amount in \xg. 

(r^): 0.999953, where Correlation Coefficient = r 
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EXAMPLE 3 

Chromosomal over-expression of PDI 

5 S. cerevisiae Strain A was selected to iavestigate the secretion of recombinant 
glycosylated transferrin expression from plasmid pDB2506 and recombinant non- 
glycosylated transferrin (N413Q5 N611Q) from plasmid pDB2536. Strain A has 
the following characteristics — 

10 • additional chromosomally integrated PDIl gene integrated at the host 

PDU chromosomal location. 

• the URA3 gene and bacterial DNA sequences containing the ampicillin 
resistaace gene were also integrated into the 6*. cerevisiae genome at the 
15 insertion sites for the above genes. 

A control strain had none of the above insertions. 

Control strain [cir°] and Strain A [cir^j were transformed to leucine prototrophy 

4 

20 with pDB2506 (recombinant transferrin), pDB2536 (recombinant non- 

glycosylated transferrin (lSr413Q, N611Q)) or pSAC35 (control). Transformants 
were selected on BMMD-agar. 

The relative level of transferrin secretion in BMMD shake flask culture was 
25 determined for each strain/plasmid combination by rocket immunoelectrophoresis 
(RIE). Figure 19 shows that both strains secreted both the glycosylated and non- 
glycosylated recombinant transferrins into the culture supernatant. 

The levels of both the glycosylated and non-glycosylated transferrins secreted 
30 from Strain A [pDB2506] and Stram A [pDB2536] respectively, appeared higher 
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than the levels secreted j&om the control strain. Hence, at least in shake flask 
culture, PDIl integra:ted into the host genome at the PDIl locus in Strain A has 
enhanced transferrin secretion. 

5 Furthermore, the increase in transferrin secretion observed between control strain 
[pDB2536] and Stram A [pDB2536] appeared to be at least a 100% increase by 
RJE. In contrast, the increase in rHA monomer secretion between control strain 
[pDB2305] and Strain A [pDB2305] was approximately 20% (data not shown). 
Therefore, the increase in transferrin secretion due to the additional copy of PDIl 

10 in Strain A was surprising large considering that transferrin has 19 disulphide 
bonds, compared to rHA with 17 disulphide bonds. Additional copies of the PDIl 
gene may be particularly beneficial for the secretion from S, cerevisiae of proteins 
from the transferrin family, and their derivatives. 

15 The levels of transferrin secreted from Strain A [pDB2536] and Strain A 
[pDB2506] were compared by RIE for transformants grown in BMMD and YEPD 
(Figure 20). Results indicated that a greater than 2-fold increase in titres of both 
non-glycosylated recombinant transferrin (N413Q, N611Q) and glycosylated 
recombinant transferrin was achieved by growth in YEPD (10-20 mg.L**^ serum 

20 transferrin equivalent) compared to BMMD (2-5 mg.L"^ serum transferrin 
equivalent). The increase in both glycosylated and non-glycosylated transferrin 
titre observed in YEPD suggested that both transferrin expression plasmids were 
sufficiently stable under non-selective growth conditions to allow the expected 
increased biomass which usually results from growth in YEPD to be translated 

25 into increased glycosylated and non-glycosylated transferrin productivity. 

SDS-PAGE analysis of non-glycosylated transferrin (N413Q, N611Q) secreted 
from Strait) A [pDB2536] and glycosylated transferrin from Strain A [pDB2506] 
grown in BMMD shake flask culture is shown in Figure 21 . Strain A [pDB2536] 
30 samples clearly showed an additional protein band compared to the Strain A 
[pSAC35] control. This extra band migrated at the expected position for the 
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recombinant transferrin (N413Q, N611Q) secreted from control strain [pDB2536]. 
Strain A [pDB2506] culture supematants appeared to contain a diffuse protein 
band at the position expected for transferrin. This suggested that the secreted 
recombinant transferrin was heterogeneous, possibly due to hyper-mannosylation 
5 at Asp4 1 3 and/or Asp6 1 1 . 

EXAMPLE 4 

Comparing transferrin secretion from S. cerevisiae control strain containing 
10 pDB2711 with transferrin secretion from S. cerevisiae Strain A 

Plasmid pDB2711 is as described above. Plasmid pDB2712 (Figure 22) was also 
produced with the Natl cassette in the opposite direction to pDB271 1 . 

15 Control strain S. cerevisiae [chP] was transformed to leucine prototrophy with 
pDB2711 and pDB2712. Transformants were selected on BMMD-agar and 
cryopreserved trehalose stocks of control strain [pDB271 1] were prepared. 

Secretion of recombinant transferrui (N413Q, N611Q) by control strain 

20 [pDB2711], control stram [pDB2712], Strain A [pDB2536], control strain 

[pDB2536] and an alternative control strain [pDB2536] was compared in both 

BMMD and YEPD shalce flask culture. RIE indicated that a significant increase 

in recombinant transferrm secretion had been achieved from control strain 

[pDB271 1] with multiple episomal PDIl copies, compared to Strain A [pDB2536] 

25 with two chromosomal copies of PDIl, and control strain [pDB2536] with a . 

single chromosomal copy of PDIl gene (Figure 23). Control stram [pDB2711] 

and control strain [pDB2712] appeared to secrete similar levels of rTf (N413Q, 

N611Q) into the culture media. The levels of secretion were relatively consistent 

between control strain [pDB2711] and control strain [pDB2712] transformants in 

30 both BMMD and YEPD media, suggesting that plasmid stability was sufficient for 

high-level transferrin secretion even under non-selective conditions. This is in 
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contrast to the previous published data in relation to recombinant PDGF-BB and 
HSA where introduction of PDIl into multicopy 2p,m plasmids was shown to be 
detrimental to the host. 



Table 3: Recombinant transferrin titres from high cell density fermentations 



Strain 


Supernatant (gX*^^) 




GP-HPLC 


SDS-PAGE 


Control [pDB2536] 


0.5/0.4 




Alternative control [pDB2536] 

« 


1.5/1.6 


0.6 


0.9/0.9 


0.4/0.4/0.5 


StrainA[pDB2536] 


0.7 


0.6 


0.6 




Control [pDB271 1] 


3.5 


3.6 


3.4 


2.7/3.1 



Reducing SDS-PAGE analysis of transferrin secreted from control strain 
[pDB2711], control strain [pDB2712], Strain A [pDB2536], control strain 

10 [pDB2536] and alternative control strain [pDB2536] in BMMD shake flask 
culture is shown in Figure 24. This shows an abundant protein band in all samples 
from control strain [pDB2711] and control strain [pDB2712] at the position 
expected for transferrin (N413Q5 N611Q). The relative stain intensity of the 
transferrin (N413Q5 N611Q) band from the different strains suggested that Strain 

15 A [pDB2536] produced more than control strain [pDB2536J and alternative 
control strain [pDB2536], but that there was an even more dramatic increase in 
secretion from control strain [pDB2711] and control strain [pDB2712]. The 
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increased recombinant transferrin secretion observed was concomitant with the 
increased PDIl copy number in these strains. This suggested that Pdilp levels 
were limiting transferrin secretion ia control strain. Strain A and the alternative 
control strain, and that elevated PDIl copy number was responsible for increased 
5 transferrin secretion. Elevated PDIl copy number could increase the steady state 
expression level of PDIl so increasing the amount of Pdilp activity. There are a 
number of alternative methods by which this could be achieved without increasing 
the copy number of the PDIl gene, for example the steady state PDIl mRNA 
level could be increased by either increasing the transcription rate, say by use of a 
10 higher efficiency promoter, or by reducing the clearance rate of the PDIl nJRNA. 
Alternatively, protein engineering could be used to enhance the specific activity or 
turnover number of the Pdilp protein. 

In high cell density fermentations control strain [pDB2711] recombinant 
15 transferrin (N413Q, N61 IQ) production was measured at approximately 3g.L"^ by 
both GP-HPLC analysis and SDS-PAGE analysis (Table 3). This level of 
production is several fold-higher than control strain, the alternative control strain 
or Strain A containing pDB2536. Furthermore, for the production of proteins for 
therapeutic use in humans, expression systems such as control strain [pDB2711] 
20 have advantages over those using Strain A, as they do not contain bacterial DNA 
sequences. 

CONCLUSIONS 

25 Secretion of recombinant transferrin from a multicopy expression plasmid 
(pDB2536) was investigated in S, cerevisiae strains containing an additional copy 
of the PDIl gene integrated into the yeast genome. Transferrin secretion was also 
investigated in S, cerevisiae transformed with a multicopy expression plasmid, in 
which the PDIl gene has been inserted into the multicopy episomal transferrin 

30 expression plasmid (pDB271 1). 
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A S. cerevisiae strain with an additional copy of the PDU gene integrated into the 
genome at the endogenous PDU locus, secreted recombinant transferrin and non- 
glycosylated recombinant transferrin (N413Q, N611Q) at an elevated level 
compared to strains containing a single copy of PDIl . A further increase in PDU 
5 copy number was achieved by using pDB271 1 In high cell density fermentation 
of the strain transformed with pDB2711, recombinant transferria (N413Q, 
N611Q) was secreted at approximately 3gX"^ as measured by SDS-PAGE and 
GP-HPLC analysis. Therefore, increased PDU gene copy number has produced a 
large increase in the quantity of recombinant transferrins secreted from S. 
10 cerevisiae. 

The following conclusions are drawn - 

1. In shake flask analysis of recombinant transferrin expression from pDB2536 
15 (non-glycosylated transferrin (N413Q, N611Q) and pDB2506 (glycosylated 
transferrm.) the S. cei^evisiae strain Strain A secreted higher levels of both 
recombinant transferrins into the culture supernatant than control strains. This 
was attributed to the extra copy of PDU hitegrated at the PDU locus. 

20 2. Control strain [pDB2711], which contained the PDU gene on the multicopy 
expression plasmid, produced a several-fold increase in recombinant transferrin 
(N413Q, N611Q) secretion compared to Strain A [pDB2536] hi both shake flask 
culture and high cell density fermentation. 

25 3. Elevated PDU copy number .in yeast such as S. cerevisiae will be 
advantageous during the production of heterologous proteins, such as those flom 
the transferrin family. 

4. pSAC35-based plasmids contaming additional copies of PDU gene have 

30 advantages for the production of proteins from the transferrin family, and their 

derivatives, such as fusions, mutants, domains and truncated forms. 
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EXAMPLES 

Insertion of a PDIl gene into a 2fjm''like plasmid increased secretion of 
5 recombinant transferrin from various different S. cerevisiae strains 

The & cerevisiae strain JRY188 cir"^ (National Collection of Yeast Cultures) and 
MT302/28B cir'^(Finms etaL, 1993, Eur, J, Biochem., 212, 201-210) was cured of 
the .native 2jjm plasmid by galactose induced over-expression of FLP from 
10 YQp351-GAL-FLPI, as described by Rose and Broach (1990, Metk EnzymoL, 
185, 234-279) to create the S, cerevisiae strains IRY188 cir^ and MT302/28B cir°, 
respectively. 

The S. cerevisiae strams JRY188 cir^ MT302/28B cir^ S150-2B cir^ (Cashmore 
15 et ah, 1986, Mol Gen. Genet, 203, 154-162), CBll-63 cir° (Zealey et al, 1988, 
Mol Gen. Genet., 211, 155-159) were all transformed to leucine prototrophy with 
pDB2931 (Figure 14) and pDB2929 (Figure 12). Transfoimants were selected on 
appropriately supplemented minimal media lacking leucine. Transformants of 
each strain were inoculated into lOmL YEPD in 50mL shake flasks and incubated 
20 in an orbital shaker at 30°C, 200rpm for 4-days. Culture supematants were 
harvested and the recombinant transferrin titres compared by rocket 
iromunoelectrophoresis (Figure 26). The results indicated that the transferrin titres 
in supematants from all the yeast strains were higher when FDIl was present in 
the 2\xm plasmid (pDB2929) than when it was not (pDB293 1) 

25 
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EXAMPLE 6 

The construction of expression vectors containing various PDIl genes and the 
expression cassettes for various heterologous proteins on the same 2fjm-like 
5 plasmid 

PGR amplification and cloning of PDIl genes into YIplaclll: 

The PDIl genes from S. cerevisiae S288c and S. cerevisiae SKQ2n were 
10 amplified by PGR to produce DNA fragments with different lengths of the 5'- 
untranslated region containing the promoter sequence. PGR ^ primers were 
designed to permit cloning of the PGR products into the EcoEI and BamHL sites of 
YIplac211 (Gietz & Sugino, 1988, Gene, 74, 527-534). Additional restriction 
endonuclease sites were also incorporated into PGR primers to facilitate 
15 subsequent cloning. Table 4 describes the plasmids constructed and Table 5 gives 
the PGR primer sequences used to amplify the FDII genes. Differences in the 
FDIJ pi'omoter length within these YIplac2 11 -based plasmids are described in 
Table 4. 

20 pDB2939 (Figure 27) was produced by PGR amplification of the PDIl gene from 
S, cerevisiae S288c genomic DNA with oligonucleotide primers DS248 and 
DS250 (Table 5), followed by digesting the PGR product with EcoBl and BamUl 
and cloning the approximately 1.98-kb fragment mto Y[plac211 (Gietz & Sugino, 
1988, Gene, 74, 527-534), that had been cut with EcoKL and BamRh DNA 

25 sequencing of pDB2939 identified a missing 'G' from within the DS248 
sequence, which is marked in bold in Table 5. Oligonucleotide primers used for 
sequencing the PDIl gene are listed in Table 6, and were designed fi'om the 
published S288c PDIl gene sequence (PDI1/YGL043G on chromosome III from 
coordinates 50221 to 48653 plus 1000 basepairs of upstream sequence and 1000 
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basepairs of downstream sequence, rhttp ://\vww. veastgenome,org/ Genebank 
Accession number NCOOl 135). 

Table 4; YIplac2 11 -based Plasmids Containing PDIl Genes 

5 



Plasmid 


Plasmid 
Base 


PDIl Gene 


PCR Primers 


Source 


Promoter 


Terminator 


pDB2939 


YIplac211 


S288C 


Long (~210-bp) 


-> Bsu36J 


DS248+DS250 


pDB2941 


YIplac211 


S2S8c 


Medium (~140-bp) 


— > BsuJdZ 


DS251+DS250 


pDB2942 


YIplac211 


S288c 


Short (~80-bp) 


Bsu56/ 


DS2S2+DS250 


pDB2943 


YIplac211 


SKQ2n 


Long (-210-bp) 




DS248+DS250 


pDB2963 


YIplac211 


SKQ2n 


Medium (-140-bp) 


-> Bsu56J 


DS267+DS250 


pDB294S 


TIplac211 


SKQ2n 


Short (--SO-bp) 


BsuJdJ 


DS252 +DS250 



1 



Table 5: Oligonucleotide Primers for PCR Amplification of iS'. cerevisiae PDIl 
Genes 



Primer 


Sequence 


PS248 


5 ' -GTCAGAATTCGAGCTCTACGTATTAATTAAGGCCGGCCAGGCCCGGGCTAGT 
CTCTTTTTCCAATTTGCCACCGTGTAGCATTTTGTTGT-3' 


DS249 


5 ' -GTCAGGATCCTACGTACCCGGGGATATCATTATCATCTTTGTCGTGGTCATCT 

TGTGTG-3' 


DS250 


5 ' -GTCAGGATCCTACGTACCCGGGTAAGGCGTTCGTGCAGTGTGACGAATAT 

AGCG-3' 


DS251 


5 ' -GTCAGAATTCGAGCTCTACGTATTAATTAAGGCCGGCCAGGCCCGGGCCCGT 
ATGGACATACATATATATATATATATATATATATATTTTGTTACGCG-3' 
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DS252 




5 ' -GTCAGAATTCGAGCTCTACGXATTAATTAAGGCCGGCCAGGCCCGGGCTTGTTG 

CAAGCAGCATGTCTAATTGGTAATTTTAAAGCTGCC-3 ' 


DS267 


5 ' -GTCAGAATTCGAGCTCTACGTATTAATTAAGGCCGGCCAGGCCCGGGCCCGTA 
TGGACATACATATATATATATATATATATATATATATATTXTGTTACGCG- 3 ' 



Table 6: Oligonucleotide Primers for DNA Sequencing S. cerevisiae PDIl Genes 



Primer 


Sequence 


DS253 


5' -CCTCCCTGCTGCTCGCC-3' 


DS254 


5 ' -CTGTAAGAA,CATGGCTCC-3 ' 


DS255 


5' -CTCGATCGATTACGAGGG-3' 


DS256 


5' -AAGAAAGCCGATATCGC-3' 


DS257 


5 *■ -CAACTCTCTGAAGAGGCG- 3 ' 


DS258 


5' -CAACGCCACATCCGACG-3' 


DS259 


5' -GTAATTCTGATCACTTTGG-3' 


■ DS260 


5' -GCACTTATTATTACTACGTGG-3' 


DS261 


5' -GTTTTCCTTGATGAAGTCG-3' 


DS262 


5" -GTGACCACACCATGGGGC-3' 


DS263 


5 ' -GTTGCCGGCGTGTCTGCC-3 


DS264 


5' -TTGAAATCATCGTCTGCG-3' 


DS265 


5' -CGGCAGTTCTAGGTCCC-3' 


DS266 


5' -CCACAGCCTCTTGTTGGG-3'' 


M13/pUC Primer (-40) 


5 ' -GTTTTCCCAGTCACGAC-3 ' 

1 
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Plasmids pDB2941 (Figure 28) and pDB2942 (Figure 29) were constructed 
similarly using the PGR primers described in Tables 4 and 5, and by cloning the 
approximately 1.90-kb and 1.85-kb EcoRL-BamHI fragments, respectively, into 
YIplac211. The correct DNA sequences were confirmed for the PDII genes in 
5 pDB2941 and pDB2942. 

The S. cerevisiae SKQ2n PDII gene sequence was PGR amplified from plasmid 
DNA containing the PD/i gene from pMA3a:G7 (US 6,291,205), also known as 
Clone C7 (Crouzet & Tuite, 1987, supra\ Farquhar et al^ 1991, supra). The 

10 SKQ2n PDII gene was amplified using oligonucleotide primers DS248 and 
DS250 (Tables 4 and 5). The approximately 2.01-kb PGR product was digested 
with EcdKl and BamHI and ligated into YIplac21 1 (Gietz & Sugino, 1988, Gene, 
74, 527-534) that has been cut with EcoRL and BamHL, to produce plasmid 
pDB2943 (Figure 30). The 5* end of the SKQ2n PDII sequence is analogous to a 

15 blunt-ended ^^^el-site extended to include the EcoRl, Sad, SnaBl, Pad, Fsel, Sfil - 
and Smal sites, the 3' end extends up to a site analogous to a blunt-ended Bsu36l 
site, extended to include a Smal, SnaBI and BamBI sites. The PDII promoter 
length is approximately 210bp. The entire DNA sequence was determined for the 
PDII fragment using oligonucleotide primers given in Table 6. This confirmed 

20 the presence of a coding sequence for the PDI protein of S, cerevisiae strain 
SKQ2n (NGBI accession number GAA38402), but with a serine residue at 
position 114 (not an arginine residue as previously published). Similarly, in the 
same way as in the S, cerevisiae S288c sequence in pDB2939, pDB2943 also had 
a missing 'G' from within the DS248 sequence, which is marked in bold in Table 

25 5. 

Plasmids pDB2963 (Figure 31) and pDB2945 (Figui'e 32) were constructed 

similarly using the PGR primers described in Tables 4 and 5, and by cloning the 

approximately 1.94-kb and 1.87-kb EcoBl-BaniHl fragments, respectively, into 

30 YIplac211. The expected DNA sequences were confirmed for the PDII genes in 

pDB2963 and pDB2945, with a serine codon at the position of amino acid 1 14. 
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The construction of pSAC35-based rHA expression plasmids with different 
PDIl genes inserted at theXcml-site after i?jEP2: 

5 pSAC35-based plasmids were constructed for the co-expression of rHA with 
different genes (Table 7). 

Table 7: pSAC35-based plasmids for co-expression of rHA with different PDIl 
genes 



Plasmid 


Plasmid 
Base 


PDIl Gene at Xcml-siie after REP2 


Heterologous Protein 
Expression Cassette 
(at Noil-site) 


Source 


Promoter 


Terminator 


Orientation 


DDB2982 


vSAC35 


SKQ2n 


Long 


Bsu36I 


A 


rHA 


pDB2983 


pSAC35 


SKQ2n 


Long 


Bsu36I 


B 


rHA 


pDB2984 


pSACSS 


SKQ2n 


Medium 


-> Bsu36I 


A 


rHA 


pDB2985 


pSAC35 


SK02n 


Medium 


-> Bsu36I 


B 


rHA 


pDB2986 


pSAC35 


SKQ2n 


Short 


Bsn36I 


A 


rHA 


pDB2987 


pSAC35 


SKQ2n 


Short 


Bsu56/ 


B 


rHA 


pDB2976 


pSAC35 


S288c 


Long 


-> Bsvl36I 


A 


rHA 


pDB2977 


pSAC3S 


S288C 


Long 


BsuJdJ 


B 


rHA 


pDB2978 


pSAC35 


S288C 


Medium 


-> Bsvl36I 


A 


rHA 


pDB2979 


pSAC35 


S288c 


Medium 


-> Bsvl36I 


B 


rHA 


pDB2980 


pSAC35 


S288C 


Short 


-> Bsu36I 


A 


rHA 


pDB2981 


pSAC3S 


S288C 


Short 


—> Bsvl36I 


B 


rHA 
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.The rHA expression cassette firom pDB2243 (Figure 33, as described in WO 
00/44772) was first isolated on a 2,992-bp Notl firagment, which subsequently was 
cloned into the iVbfl-site of pDB2688 (Figure 4) to produce pDB2693 (Figure 34). 
pDB2693 was digested with SndBX treated with calf intestinal alkaline 
5 phosphatase, and hgated with SndSl fragments containing the PDIl genes jfrom 
pDB2943, pDB2963, pDB2945, pDB2939, pDB2941 and pDB2942. This 
produced plasmids pDB2976 to pDB2987 (Figures 35 to 46). PDIl transcribed m 
the same orientation as REP2 was designated "orientation A", whereas PDIl 
transcribed in opposite orientation to REP2 was designated "orientation B" (Table 
10 7). 

The construction of pSAC35-based transferrin expression plasmids with 
different PDIl genes inserted at theXcml-site after KEP2: 

15 pSAC35-based plasmids were constructed for the co-expression of recombinant 
transferrin (N413Q, N611Q) with different PD/i genes (Table 8). 

Table 8: pSAC35-based plasmids for co-expression of transferrin with different 
PDIl genes 

20 



Plasmid 


Plasmid 
Base 


PMl Gene at Xcml-site after REP2 


Heterologous Protein 
Egression Cassette 
(at ISotL'Site) 


Source 


Promoter 


Terminator 


Orientation 


pDB2929 


pSAC35 


SKQ2n 


Long 


-> Bsu36I 


A 


rTf(N413Q, N611Q) 


pDB3085 


pSAC35 


S288C 


Long 


Bsu36I 


A 


rTf(N413Q,N611Q) 


pD£3086 


pSACSS 


S288c 


Medium 


-> Bsu36I 


A 


rTf(N413Qy N611Q) 


pDB3087 


pSAC35 


S288C 


Short 


Bsu36I 


A 


rTf(N413Q, N611Q) 
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In order to achieve this, the NotI expression cassettes for rHA expression were 
first deleted firom pDB2976, pDB2978, and pDB2980 by NotI digestion and 
circularisation of the vector backbone. This produced plasmids pDB3081 (Figure 
47), pDB3083 (Figure 48) and pDB3084 (Figure 49) as described m Table 9. 

5 

Table 9: pSACSS-based plasmids with different PDIl genes 



Plasmid 


Plasmid 
Base 


PDIl Gene at Xcml-site after REP2 


Heterologous Protein 
Expression Cassette 
(at Notl-site) 


Source 


Promoter 


Terminator 


Orientation 


pDB2690 


pSAC35 


SKQ2n 


Long 


—> Bsu5dJ 


A 


None 


pDB3081 


pSAC35 


S288C 


Long 


Bsu56J 


A 


None 


pDB3083 


pSAC35 


S288C 


Medium 


-> Bsu567 


A 


None 


pDB3084 


pSAC35 


S288C 


Short 


Bsu5<J/ 


A 


None 



The 3,256-bp NotI firagment jfrom pDB2928 (Figure 1 1) was cloned mto the Notl- 
10 sites of pDB3081, pDB3083 and pDB3084, such that transcription firom the 
transferrin gene was in the same direction as LEU2. This produced plasmids 
pDB3085 (Figure 50), pDB3086 (Figure 51) and pDB3087 (Figure 52) as 
described in Table 8. 

15 EXAMPLE 7 

Insertion and optimisation of a PDIl gene in the 2/jm-like plasmid increased 
the secretion of recombinant human serum albumin by various different S. 
cerevisiae strains 

20 - - 

The S, cerevisiae strains JRY188 cir^, MT302/28B cir^ S150-2B cir^ , CBll-63 
cir^ (all described above), AH22 cir^ (Mead et al, 1986, Mol Gen. Genet,, 205, 
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417-421) and DS569 cir" (Sleep et aL, 1991, Bio/Technology, 9, 183-187) were 
transformed to leucine prototrophy with either pDB2244 (WO 00/44772), 
pDB2976 (Figure 35), pDB2978 (Figure 37) or pDB2980 (Figure 39) using a 
modified lithium acetate method (Sigma yeast transformation kit, YEAST-1, 
5 protocol 2; (Ito et al, 1983, J, Bacteriol, 153, 163; Elble, 1992, Biotechniques, 13, 
18)). Transformants were selected on BMMD-agar plates with appropriate 
supplements, and were subsequently patched out on BMMD-agar plates with 
appropriate supplements. 

Transformants of each strain were inoculated into lOmL YEPD in 50mL shake 
10 flasks and incubated in an orbital shaker at 30^C, 200rpm for 4-days. Culture 
supematants were harvested and the recombinant albumin titres compared by 
rocket Immunoelectrophoresis (Figures 53 and 54). The results indicated that the 
albumin titres in the culture supematants fi-om all the yeast strains were higher 
when PDIl was present in the 2^m plasmid than when it was not (pDB2244). 
15 The albumin titre in the culture supematants in the absence of PDIl on the 
plasmid was dependant upon which yeast strain was selected as the expression 
host, however, in most examples tested the largest increase in expression was 
observed when PDIl with the long promoter (~210-bp) was present in the 2|Lim 
plasmid (pDB2976). Modifymg the PDIl promoter by shortening, for example to 
20 delete regulation sites, had the affect of controlling the improvement. For one 
yeast strain, known to be a high rHA producing strain (DS569) a shorter promoter 
was preferred for optimal expression. 
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EXAMPLES 



Different PDIl genes enhanced the secretion of recombinant transferrin when 
co-expressed on a 2 [jm-based plasmid. 

5 

The secretion of recombinant transferrin (N413Q, N611Q) was investigated with 
co-expression of the S. cerevisiae SKQ2n PDIl gene with the long promoter 
(-210-bp), and the S. cerevisiae S288c PDIl with the long, medium and short 
promoters (-210 bp, -140 bp and -80 bp respectively). 

* 10 

The same Control Strain as used in previous examples (e.g. Example 2) was 
transformed to leucine prototrophy with pDB2931 (negative control plasmid 
without PDIl) and pDB2929, pDB3085, pDB3086 and pDB3087 (Table 8). 
Transformants were selected on BMMD-agar plates and five colonies selected for 
15 analysis. Strains were grown in lOmL BMMD and lOmL YEPD shake flask 
cultures for 4-days at 30°C, 200rpm and culture supematants harvested for 
analysis by rocket immunoelectrophoresis (RIE). 

Figure 55 shows that in nainimal media (BMMD) the S. cerevisiae SKQ2n PDIl 
20 gene with the long promoter gave the highest rTF (N413Q, N61 IQ) titres. The S, 
cerevisiae S288c PDIl gene gave lower rTF (N413Q, N611Q) titres, which 
decreased further as the PDIl promoter length was shortened. 

Figure 56 shows that in rich media (YEPD) the S, cerevisiae SKQ2n PDIl and S. 
25 cerevisiae S288c PDIl genes with the long promoters gave similar rTF (N413Q, 
N611Q) production levels. Also, the shorter the promoter length of the 
cerevisiae S288c PDIl gene the lower was the rTF (N413Q, N611Q) production 
level. 
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EXAMPLE 9 



PDIl on the 2fjm''based plasmid enhanced the secretion of recombinant 
albumin fusions. 

5 

The affect of co-expression of the S. cei^evisiae SKQ2n PDIl gene with the long 
promoter (~210-bp) upon the expression of recombinant albumin fusions was 
investigated. 

10 The construction of a Notl N-terminal endostatin-albixmin expression cassette 
(pDB2556) has been previously described (WO 03/066085). Appropriate yeast 
vector sequences were provide by a "disintegration" plasmid pSAC35 generally 
disclosed in EP-A-286 424 and described by Sleep, D., et aL, 1991, 
Bio/Technology, 9, 183-187. The 3.54kb Notl N-terminal endostatin-albiinun 

15 expression cassette was isolated from pDB2556, purified and ligated into Notl 
digested pSAC355 which had been treated with calf intestinal phosphatase, 
creating plasmid pDB3099 containing the Notl expression cassette in the same 
orientation to the LEU2 selection marker (Figure 57). An appropriate yeast PDIl 
vector sequences were provide by a "disintegration" plasmid pDB2690 (Figure 6). 

20 The 3.54kb Notl N-terminal endo statin-albumin expression cassette was isolated 
from pDB2556, purified and ligated into Notl digested pDB2690, which had been 
treated with calf intestinal phosphatase, creating plasmid pDBSlOO containing the 
Notl expression cassette in Ihe same orientation to the LEU2 selection marker 
(Figure 58). 

25 

The construction of an Notl N-terminal angiostatin-albumin expression cassette 
(pDB2556) has been previously described (WO 03/066085), as has the 
construction of a pSAC35-based yeast expression vector, pDB2765 (FigxH-e 59). 
The 3.77kb Notl N-terminal angiostatin-albumin expression cassette was isolated 
30 ftom pDB2556, purified and ligated into Notl digested pDB26905 an appropriate 
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yeast PDU expression vector, which had been treated with calf intestinal 
phosphatase, creating plasmid pDB3 1 07 containing the Notl expression cassette in 
the same orientation to the LEU2 selection marker (Figure 60). 

5 The construction of an Notl N-terminal Kringle5-(GGS)4GG-albumin expression 
cassette (pDB2771) has been previously described (WO 03/066085), as has the 
construction of a pSAC35-based yeast expression vector, pDB2773 (Figure 61). 
The 3.27kb Notl N-terminal Kringle5-(GGS)4GG-albumin expression cassette was 
isolated from pDB2771, purified and ligated into Notl digested pDB2690, an 
10 appropriate yeast PDIl expression vector, which had been treated with calf 
intestinal phosphatase, creating plasmid pDB3 1 04 containing the Notl expression 
cassette in the same orientation to the LEU2 selection marker (Figure 62). 

The construction of an Notl N-terminal DX-890-(GGS)4GG-albumin expression 
15 cassette (pDB2683) has been previously described (WO 03/066824). Appropriate 
yeast vector sequences were provide by the "disintegration" plasmid pSAC35.. " 
The 3.20kb Notl N-terminal DX-890-(GGS)4GG-albumin expression cassette was 
isolated from pDB2683, purified and Ugated mto Notl digested pSAC35, which 
had been treated with calf intestinal phosphatase, creating plasmid pDB3101 
20 containing the Notl expression cassette in the same orientation to the LEU2 
selection marker (Figure 63). An appropriate yeast PDIl vector sequences were 
provide by a "disintegration" plasmid pDB2690 (Figure 6). The 3.20kb Notl 
N-terminal DX-890-(GGS)4GG-albumin expression cassette was isolated from 
pDB2683, purified and Ugated mto Notl digested pDB2690, which had been 
25 treated with calf intestinal phosphatase, creating plasmid pDB3 1 02 contamiag the 
Notl expression cassette ki the same orientation to the LEU2 selection marker 
(Figure 64). 

The construction of an Notl N-terminal DPI-14-(GGS)4GG-albumin expression 

30 cassette (pDB2666) has been previously described (WO 03/066824), as has the 

construction of a pSAC35-based yeast expression vector, pDB2679 (Figure 65). 

110 



wo 2005/061718 



PCT/GB2004/005462 



The 3.21kb Notl N-terminal DPH4-(GGS)4GG"albuinin expression cassette was 
isolated from pDB2666, purified and ligated into Notl digested pDB2690, an 
appropriate yeast PDIl expression vector, which had been treated with calf 
intestinal phosphatase, creatmg plasmid pDB3103 containing the Notl expression 
5 cassette in the same orientation to the LEU2 selection marker (Figure 66). 

CNTF was cloned from human genomic DNA by amplification of the two exons 
using the following primers for exon 1 and exon 2, respectively, using standard 
conditions. 

10 

Exon 1 primers: 

5 ' -CTCGGTACCCAGCTGACTTGTTTCCTGG-3 ' ; and 
5 " -ATAGGATTCCGTAAGAGCAGTCAG-3 ' 

15 Exon 2 primers: 

5 ' "GTGAAGCATCAGGGCCTGAAC-3 ; ' and 

5 ' -CTCTCTAGAAGCAAGGAAGAGAGAAGGGAC-3 ' 

Both fragments were ligated under standard conditions, before being re-amplified 
20 by PGR using primers 5 ^-CTCGGTACCCAGCTGACTTGTTTCCTGG-3 ' and 
5 '-CTCTCTAGAAGCAAGGAAGAGAGAAGGGAC'-3 ' and cloned mto vector 
pCR4 (Invitrogen). To generate Axoldne™ (as disclosed in Lambert et aU 2001, 
PNAS, 98, 4652-4657) site-directed mutagenesis was employed to introduce CI 7A 
(TGT->GCT) and Q63R (CAG-)^AGA) mutations.* DNA sequencing also 
25 revealed the presence of a silent T->C substitution V85V (GTT-^GTC) as 
described in WO 2004/015113. 

The Axokine™ cDNA was amplified by PGR using single stranded 
oligonucleotides MH33 aad MH36 to create an approximate 0.58kbp PGR 
30 firagment 
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MH33 

5 ' -ATGCAGATeTTTGGATAAGAGAGCTTTCACAGAGCATTCACCGCTGACCCC-3 ' 

MH36 

5 5 ' -CACCGGATCCACCCCCAGTCTGATGAGAAGAAATGAAACGAAGGTCATGG-3 ' 

This was achieved with FastStart Taq DNA polymerase (Roche) in a 50mL 
reaction, which was initiated by a 4-mmute incubation at 95 °C and followed by 25 
cycles of PCR (95°C for SOsecs, 55°C for SOsecs, 72°C for 60sec). A PGR 

10 product of the expected size was observed in a lOmL sample following 
electrophoresis in an ethidium bromide stained 1% agarose gel. The remaining 
PCR product was purified using a QIAquick PCR purification kit (Qiagen) and 
digested to completion with BamBI and BgUI, DNA of approximately the 
expected size was excised fiom an ethidium bromide stained 1% (w/v) agarose gel 

15 and purified. 

Plasmid pDB2573X provided a suitable transcription promoter and terminator, 
along with a suitable secretory leader sequence and DNA sequences encoding part 
of a (GGS)4GG peptide linker fused to the N-terminus of human albumin. The 
20 construction of pDB2573X has been previously described (WO 03/066824), 

The 0.57kb BamHl and BgUl digested PCR product was ligated with pDB2573X, 
which had been digested with BamBI, BglR and calf intestinal alkaline 
phosphatase to create plasmid pDB2617 (Figure 95) and the correct DNA 
25 sequence confirmed for the PCR generated fragment and adjacent sequences using 
oligonucleotide primers CF84, CF85, PRB and DS229. 

CF84 

5' -CCTATGTGAAGCATCAGGGC-3' 

30 
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CF85 

5 ' -CCAACATTAATAGGCATCCC-3 ' 

PRB 

5 5' -CGTCCCGTTATATTGGAG-3' 

DS229 

5 ' -CTTGTCACAGTTTTCAGCAGATTCGTCAG-3 ' 

10 Plasmid pDB2617 was digested with Ndel and NotI, and the 3,586-kb NotI 
expression cassette for Axokine™-(GGS)4GG-albmnin secretion was pnrified 
jfrom an agarose gel. 

Appropriate yeast vector sequences were provided by the "disintegration'' plasmid 
15 pSAC35. The 3.586kb Notl N-terminal Axokine™-(GGS)4GG-albiunin 
expression cassette was isolated firom pDB2617, purified and ligated into Notl 
digested pSAC35, which had been treated with calf intestinal phosphatase, 
creating plasmid pDB2618 containing the Notl expression cassette in the same 
orientation to the LEU2 selection marker (Figure 96). Appropriate yeast PDIl 
20 vector sequences were provide by a "disintegration" plasmid pDB2690 (Figure 6). 
The 3.586kb Notl N-terminal Axokine™-(GGS)4GG-albumin expression cassette 
was isolated j&om pDB2617, purified and ligated into Notl digested pDB26905 
which had been treated with calf intestinal phosphatase, creating plasmid 
pDB3106 containing the Notl expression cassette in the same orientation to the 
25 LEU2 selection marker (Figure 68). 

A human ILIO cDNA (NCBI accession number (NM_000572) was amplified by 
PGR using single stranded oligonucleotides CF68 and CF69. 

30 CF68 

5 ' -GCGCAGATCTTTGGATAAGAGAAGCCCAGGCCAGGGCACCCAGTCTGAGAACAGCTGCAC-3 ' 
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CF69 

5' -GCTTGGATCCACCGTTTCGTATCTTCATTGTCATGTAGGCTTCTATGTAG-S' 

5 The 0.43kb DNA fragment was digested to completion with BamBI and partially 
digested with BglH and the 0.42kb Bglll-BamHI DNA fragment isolated. 

Plasmid pDB2573X provided a suitable transcription promoter and terminator, 
along with a suitable secretory leader sequence and DNA sequences encoding part 
10 of a (GGS)4GG peptide linker ftised to the N-terminus of human albumin. The 
construction of pDB2573X has been previously described (WO 03/066824). 

Plasmid pDB2573X was digested to completion with BgUl and BamHL, the 6.2 Ikb 
DNA fragment was isolated and treated with calf intestinal phosphatase and then 

15 ligated with the 0.42kb BgEUBamm N-termmal ILIO cDNA to create pDB2620 
(Figure 69). Appropriate yeast vector sequences were provided by the 
"disintegration" plasmid pSAC35. The 3.51kb Notl N-terminal IL10-(GGS)4GG- 
albutnin expression cassette was isolated from pDB2620, purified and ligated into 
iVo^I digested pSAC35, which had been treated with calf intestinal phosphatase, 

20 creating plasmid pDB2621 containing the Notl expression cassette in the same 
orientation to the LEU2 selection marker (Figure 70). An appropriate yeast PDIl 
vector sequences were provide by a "disintegration" plasmid pDB2690 (Figure 6). 
The 3.5 Ikb Notl N-terminal IL10-(GGS)4GG-albumin expression cassette was 
isolated from pDB2620, purified and ligated into Notl digested pDB2690, which 

25 had been treated with calf intestinal phosphatase, creating plasmid pDB3105 
containing the Notl expression cassette in the same orientation to the LEU2 
selection marker (Figxare 71). 

The same control yeast strain as used in previous examples was transformed to 
30 leucine prototrophy using a modified lithium acetate method (Sigma yeast 
transformation kit, YEAST- 1, protocol 2; (Ito et al, 1983, Bacteriol, 153, 163; 
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Bible, 1992, Biotechniques, 13, 18)). Transformants were selected on BMMD- 
agar plates, and were subsequently patched out on BMMD-agar plates. 
Cryopreserved trehalose stocks were prepared from lOmL BMMD shake flask 
cultures (24 hrs, 30°C, 200rpm). 

5 

Transformants of each stram were inoculated into lOmL BMMD in 50mL shake 
flasks and incubated in an orbital shaker at 30°C, 200rpm for 4-days. Culture 
supematants were harvested and the recombinant albumin fusion titres compared 
by rocket inununoelectrophoresis (Figure 72). The results indicated that the 
10 albumin fusion titre in the culture supematants from yeast strain was higher when 
PDIl was present in the 2jxm plasmid than when it was not. 

The increase in expression of the albumm fusions detected by rocket 
Immunoelectrophoresis was further studied by SDS-PAGE analysis. BMMD 
15 shake flask cultures of YBX7 expressing various alb-umin-fusions were grown for 
4-days in an orbital shaker at 30°C, 200rpm. A sample of the culture supernatant 
was analysed by SDS-PAGE (Figure 73). A protein band of the expected size for 
the albumin fusion under study was observed increase in abundance. 

20 EXAMPLE 10 

Co-expression of S. cerevisiae ORM2 and recombinant transferrin on a 
based plasmid 

25 The ORM2 gene from S, cerevisiae S288c was cloned into the Xcml-site after 
REP2 on a pSAC35-based plasmid containing an expression cassette for rTf 
(N413Q, N61 1 Q) at the NotR-sitQ in the UL-region. 

Plasmid pDB2965 (Figure 74) was constructed by insertion of the 3,256-bp Natl 
30 fragment containing the rTf (N413Q, N61 IQ) expression cassette from pDB2928 
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(Figure 1 1) into the Notl-sitG of pDB2688 (Figure 4). pDB2688 was linearised by 
Notl digestion and was treated with alkaline phosphatase. The rTf expression 
cassette from pDB2928 was cloned into the Notl site of pDB2688 to produce 
pDB2965, with the transferrin gene transcribed in the same direction as LEU2. 

The ORM2 gene was amplified from S. cerevisiae S288c genomic DNA by PGR 
with oligonucleotide prhners GSll and GS12 (Table 10) using the Expand High 
Fidelity ^^"^ PGR System (Roche). 

Table 10: Oligonucleotide Primers for PGR Amplification of S, cerevisiae 
Ghaperones 



Primer 


Description 


OUgonucieotide Sequence 


GSll 


ORM2 primer, 54mer 


1 5 ' -GCGCTACGTATTAATTAAATTGCTCATATATA 

GTGGGGGGGAATACTCATGCTG~3 ' 


GS12 


ORM2 primer^ 49raer 


5 ' -GCGCTACGXAGGCCGGCCAGAGAATATAAAGAA 
AGATGATGATGTAAGG-3 ' 


CED037 


SSAI primer, 70mer 


5 ' -ATACGCGCATGCGAATAATTTTTTTTTGCCTATC 
TATAAAATTAAAGTAGCAGTACTTCAACCATTAGTG--3' 


CED038 


SSAI primer, 50mer 


5 ' -ATACGCGCATGCCGACAAATTGTTACGTTGTGCTTTG 

ATTTCTAAAGCGC-3 


CED009 


PiSEZ primer, 50mer 


5 ' -ATAGCGGGATCCAAGCTTCGACACATACATAATAACT 

CGATAAGGTATGG- 3 ' 


CEDOlO 


PSEI primer, 3 9mer 


5 ' -TATCGCGGATCCCGTCTTCACTGTACATTACACAT 

AAGC-3'' 



Primers were designed to incorporate SndBl and Pad restriction recognition sites 
at the 5' end of the forward primer and SnaBI and Fsel restriction recognition sites 
at the 5' end of the reverse primer for cloning into the liiiker at the Xcml-site of the 
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vector, pDB2965. PGR was carried out under the following conditions: 200 jaM 
dNTP mix, 2.5 U of Expand HiFi enzyme blend, 1 x Expand HiFi reaction buffer, 
0.8 ng genomic DNA; 1 cycle of 94°C for 2 minutes, 30 cycles of 94°C for 30 
seconds, 55 °C for 30 seconds, 72°C for 3 minutes, and 1 cycle 72°C for 7 
5 minutes. 0.4 |liM of each primer was used. The required 1,195-bp PGR product 
and the pDB2965 vector were digested with Pad and Fsel, ligated together and 
transformed into competent coli DH5a cells. Ampicillin resistant 
transformants were selected. Oi?M2-containing constructs were identified by 
restriction enzyme analysis of plasmid DNA isolated from the ampicillin resistant 
10 clones. Four plasmid clones were prepared pDB3090, pDB3091, pDB3092, and 
pBD3093, all of which had the same expected DNA fragment pattern during 
restriction analysis (Figure 75). 

The S. cerevisiae Control Strain and Strain A (as described in Example 3) were 
15 selected to investigate the effect on transferrin secretion when the transferrin and 
ORM2 genes were co-expressed from the 2p-m-based plasmids. The Control 
Strain and Strain A were transformed to leucine prototrophy by plasmids 
pDB3090, pDB3092 and pBD3093, as well as a control plasmid pDB2931 (Figure 
14), containing the rTf (N413Q, N611Q) expression cassette without ORM2, 
20 Transformants were selected on BMMD agar and patched out on BMMD agar for 
subsequent analysis. 

To investigate the effect of ORM2 co-expression on transferrin secretion, 1 OmL 
selective (BMMD) and non-selective (YEPD) liquid media were inoculated with 
25 strains containing the ORM2/transferrin co-expression plasmids. The shake flask 
culture was then incubated at 30°G with shaking (200 rpm) for 4 days. The 
relative level of transferrin secretion was determined by rocket gel 
immunoeletrophoresis (RIE) (Figure 76). 
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Levels of transferrin secreted from Control Strain [pDB3090] and Control Strain 
[pDB3092] were greater than the levels j&om Control Strain [pDB2931] in both 
BMMD and YEPS media. Similarly, the levels of transferrin secreted from both 
Strain A [pDB3090] and Strain A [pDB3093] were greater than the levels from 
5 Strain A [pDB293 1] in both BMMD and YEPS media. Transferrin secretion from 
all Strain A transformants was higher than the Control Strain transformants grown 
in the same media. Strain A contains an additional copy of PDIl in the genome, 
which enhanced transferrin secretion. Therefore in Strain A, the increased 
expression of ORM2 and PDIl had a cumulative effect on the secretion of 
10 transferrin. 

EXAIVIPLE 11 

Co-expression of S. cerevisiae PSEl and recombinant transferrin on a 2^im- 
15 based plasmid 

The PSEl gene from S, cerevisiae S288c was cloned into the Xcml-site after 
REP 2 on a pSAC35-based plasmid containing an expression cassette for rTf 
(N4 1 3 Q, N6 1 1 Q) at the Notl-sitQ in the UL-region. 

20 

The 3.25-kp wild-type PSEl gene was amplified from S. cerevisiae S288c 
genomic DNA by PCR with oligonucleotide primers CED009 and CEDOlO (Table 
10) using the Expand High Fidelity PCR Kit (Roche). Primers were designed to 
incorporate BamBl restriction recognition sites at the 5' end to facilitate cloning 

25 into the vector, pUC19. PCR was carried out under the following conditions: 1 
cycle of 94°C for 2 minutes; 10 cycles of 94°C for 15 seconds, 45°C for 30 
seconds, 68°C for 4 mmutes and 30 seconds; 20 cycles of 94°C for 15 seconds, 
45°C for 30 seconds, SS^'C for 4 minutes and 30 seconds (increasing 5 seconds per 
cycle); and 1 cycle of 68°C for 10 minutes. The required PCR product was 

30 digested with BamBl then Ugated into pUC19, which had been digested with 
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BamUL and treated with alkaline phosphatase, producing construct pDB2848 
(Figure 77). Sequencing of pDB2848 confirmed that amplified sequences were as 
expected for S. cerevisiae S288c PSEl, when compared to the sequence from 
PSEi/YMR308C on chromosome XIII from coordinates 892220 to 888951 plus 
5 1000 basepairs of upstream sequence and 1000 basepairs of downstream sequence 
(Saccharomyces Genome Database at http://www.veastgenome.Qrg/\ The PSEl 
gene was then excised from pDB2848 by BarjtHI digestion, and the resulting 
4,096-bp fragment phenol: chloroform extracted, ethauol precipitated and treated 
with DNA polymerase Klenow fragment to fill in the 5 '-overhang. Plasmid 

10 pDB2965 (Figure 74) was linearised by SnaBI digestion, and alkaline phosphatase 
treated. The linearised pDB2965 vector and the FSEl insert were ligated, and 
transformed into competent E. coli DH5a cells. Ampicillin resistant 
transformants were selected. Plasmids pDB3097 (Figure 78) and pDB3098 
(Figure 79) were identified to contain the FSEl gene by restriction en2yme 

15 analysis of plasmid DNA isolated from the ampicillin resistant clones. La 
pDB3097 the PSEl gene is transcribed in the same orientation as REP 2, whereas 
in pDB3097 the PSEl gene is transcribed in the opposite orientation to REP 2. 

The S. cerevisiae Control Strain was transformed to leucine prototrophy by 
20 plasmids, pDB3097 and pBD3098, as well as a control plasmid pDB2931 (Figure 
14), containing the rTf (N413Q, N611Q) expression cassette without PSEL 
Transformants were selected on BMMD agar and patched out on BMMD agar for 
subsequent analysis. 

25 To investigate the effect of PSEl expression on transferrin secretion, flasks 
containing lOmL selective (BMMD) liquid media were inoculated with strains 
containing the PSEi/transferrin co-expression plasmids. The shalce flask culture 
was then incubated at 30°C with shaking (200 rpm) for 4 days. The relative level 
of transferrin secretion was determined by rocket gel immunoeletrophoresis (RIE) 

30 (Figure 80). 
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Levels of transferrin secreted from Control Strain [pDB3097] and Control Strain 
[pDB3098] were greater than the levels from Control Strain [pDB2931] in 
BMMD media. Therefore, expression of PSEl from the Ijixm-based plasmids had 
5 enhanced transferrin secretion from S. cerevisiae. Transferrin secretion was 
improved with the PSEl gene transcribed in either direction relative to the REP 2 
gene in pDB3097 and pDB3098. 

EXAMPLE 12 

10 

Co-expression of S. cerevisiae SSAl and recombinant transferrin on a 2fim- 
based plasmid 

The SSAl gene from S. cerevisiae S288c was cloned into the Xcml-site after REP2 
15 on a pSAC35-based plasmid containing an expression cassette for rTf (N413Q5 
N61 IQ) at the iVofl-site in the UL-region. 

The L93-kb SSAl gene was amplified from S, cerevisiae S288c genomic DNA by 
PCR with oligonucleotide primers CED037 and CED038 (Table 10) using the 

20 Expand High Fidelity PCR Kit (Roche). Primers were designed to incorporate 
jS^M restriction recognition sites at their 5' ends to facilitate cloning into the 
vector, pUC19. PCR was carried out under the following conditions: 1 cycle of 
94°C for 10 minutes, 35 cycles of 94°C for 1 minute, 55°C for 1 minute, 72°C for 
5 minutes, and 1 cycle of 72°C for 10 minutes. The required PCR product was 

25 digested with Sphl then ligated into pUC19, which had been digested with Sphl 
and treated with alkaline phosphatase, producing construct pDB2850 (Figure 81). 
Sequencing of pDB2850 confirmed the expected sequence of S. cerevisiae S288c 
SS4i/YAL005C on chromosome I from coordinates 141433 to 139505 plus 1000 
basepairs of upstream sequence and 1000 basepairs of downstream published in 

30 the Saccharomyces Genome Database f http : //www. veast genome , org/) . 
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The SSAl gene was excised from pDB2850 by 5;p/2l-digestion, and the resulting 
2,748-bp fragment phenolrchloroform extracted, ethanol precipitated and treated 
with T4 DNA polymerase to remove the 3 -overhang. Plasmid pDB2965 was 
5 linearised by SnaBl digestion and treated with calf alkaline phosphatase. The 
linearised pDB2965 vector and the SSAl insert were ligated and transfomied into 
competent E, coli DH5a cells. Ampicillui resistant transformants were selected, 
SSAl constructs pDB3094 (Figure 82), and pDB3095 (Figure 83) were identified 
by restriction enzyme analysis of plasmid DNA isolated from the ampicillin 
10 resistant clones. In pDB3094, the SSAl gene is transcribed in the same direction 
as KEP2, whereas in pDB3095 the SSAl gene is transcribed in the opposite 
direction to REP2. 

The S. cerevisiae Control Strain was transformed to leucine prototrophy by 
15 plasmids, pDB3094 and pBD3095, as well as a control plasmid pDB2931 (Figure 
14), containing the rTf (N413Q, N611Q) expression cassette without SSAL 
Transformants were selected on BMMD agar and patched out on BMMD agar for 
subsequent analysis. 

20 To investigate the effect of SSAl expression on transferrin secretion, flasks 
contaming lOmL selective (BMMD) liquid media were inoculated with strains 
containing the SSAi/transferrin co-expression plasmids. The shake flask cviltures 
were incubated at SO'^C with shaking (200 iprn) for 4 days. The relative level of 
transferrin secretion was determined by rocket gel immunoeletrophoresis (RIE) 

25 (Figure 84). 

Levels of transferrm secreted from Control Strain [pDB3095] were greater than 
the levels from Control Strain [pDB2931] and Control Strain [pDB3094] in 
BMMD media. Therefore, expression of SSAl from the 2fim.-based plasmids had 
30 enhanced transferrin secretion from S, cerevisiae. Transferrin secretion was 
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improved with Hho^SSAl gene transcribed in tbe opposite direction relative to the 
REF2 gene in pDB3094. 

EXAMPLE 13 

5 

PDIl gene disruption, combined with a PDIl gene on the l^jsn-^based plasmid 
enhanced the secretion of recombinant albumin and plasmid stability. 

Single stranded oligonucleotide DNA primers listed m Table 1 1 were designed to 
10 amplify a region upstream of the yeast PDIl coding region and another a region 
downstream of the yeast PDIl coding region. 



Table 11: Oligonucleotide primers 



Primer 


Description 


Sequence 


DS299 


5' PDIl 
primer, 38mer 


5'- CGTAGCGGCCGCCTGAAAGGGGTTGACCGTCCGT 
CGGC -3' 


DS300 


5' PDIl 
primer, 40mer 


5'-CGTAAAGCTTCGCCGCCCGACAGGGTAACATATTAT 
CAC -3' 


DS301 


3' PDIl 
primer, 38mer 


5 ' -CGTAAAGCTTG ACCACGTAGTAATAATAAGTGC AT 
GGC-3' 


DS302 


3' PDIl 
primer, 41mer. 


5'-CGTACTGCAGATTGGATAGTGATTAGAGTGTATAGTCC 
CGG-3' 


DS303 


18mer 


5'-GGAGCGACAAACCTTTCG-3 ' 


DS304 


20mer 


5'-ACCGTAATAAAAGATGGCTG-3 ' 


DS305 


24mer 


5'-CATCTTGTGTGTGAGTATGGTCGG-3 ' 


DS306 


14mer 


5 ' -CCCAGGATAATTTTCAGG-3 ' 



15 

Primers DS299 and DS300 amplified the 5' region of PDIl by PGR, while primers 
DS301 and DS302 amplified a region 3' of PDIl, using genomic DNA derived 
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S288c as a template. The PGR conditions were as follows: l|xL S288c template 
DNA (at O.Olng/iLLL, O.lng/jiL, Ing/^U lOng/fxL and 100ng/|LLL), 5\xL lOXBuffer 
(Fast Start Taq+Mg, (Roche)), 1|llL lOmM dNTP's, 5|aL each primer (2|aM), 
QA\iL Fast Start Taq, made up to 50jj-L with H2O. PCRs were performed uskig a 
5 Perkin-Ehner Thermal Cycler 9700. The conditions were: denature at 95°C for 
4min [HOLD], then [CYCLE] denature at 95 °C for 30 seconds, anneal at 45 °C for 
30 seconds, extend at 72°C for 45 seconds for 20 cycles, then [HOLD] 72°C for 
lOmin and then [HOLD] 4°C. The 0.22kbp FDTI 5' PCR product was cut with 
NotI and HindDI, while the 0.34kbp PDII 3* PCR product was cut with HindLIl 
10 and FstL 

Plasmid pMCS5 (Hoheisel, 1994, Biotechniques 17, 456-460) (Figure 85) was 
digested to completion with ifmdlll, blunt ended with T4 DNA polymerase plus 
dNTPs and religated to create pDB2964 (Figure 86). 

15 

Plasmid pDB2964 was HindllL digested, treated with calf intestinal phosphatase, 
and ligated with the 0.22kbp PDII 5' PCR product digested with Notl and Hindlll 
and the 0.34kbp PDII 3' PCR product digested with Hindlll and Pstl to create 
pDB3069 (Figure 87) which was sequenced with forward and reverse xxniversal 
20 primers and the DNA sequencing primers DS303, DS304, DS305 and DS306 
(Table 11). 

Primers DS234 and DS235 (Table 12) were used to amplify the modified TRPl 
marker gene from YIplac204 (Gietz & Sugino, 1988, Gene, 74, 527-534), 

25 incorporating HindJR restriction sites at either end of the PCR product. The PCR 
conditions were as follows: IjaL template YIplac204 (at 0.01ng/|aL, O.lng/p^L, 
lng/|iL, 10ng/|aL and 100ng/|aL), 5p.L lOXBuffer (Fast Start Taq+Mg, (Roche)), 
l\xL lOmM dNTP's, Syi. each primer (2iliM), 0.4|llL Fast Start Taq, made up to 
50|llL with H2O. PCRs were performed using a Perkin-Elmer Thermal Cycler 

30 9600. The conditions were: denature at 95°C for 4min [HOLD], then [CYCLE] 
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denature at 95°C for 30 seconds, anneal for 45 seconds at 45°'e, extend at 72°C 
for 90sec for 20 cycles, then [HOLD] 72°C for lOmin and then [HOLD] 4°C. The 
0.86kbp PGR product was digested with HindllL and cloned into the HindUl site 
of pMCS5 to create pDB2778 (Figure 88), Restriction enzyme digestions and 
5 sequencing with universal forward and reverse primers as well as DS236, DS237, 
DS238 and DS239 (Table 12) confnmed that the sequence of the modified TRPI 
gene was correct. 

Table 12: Oligonucleotide primers 

10 



Primer 


Description 


Sequence 


DS230 


TRFl 5' UTR 


5'-TAGCGAATTC AATCAGTAAAAATGAACGG-3 ' 


DS231 


TRPI 5' UTR 


5'-GTCAAAGCTTCAAAAAAAGA AAAGCTCCGG-3 ' 


DS232 


TRPI 3' UTR 


5'-TAGCGGATCCGAATTCGGCGGTTGTTTGCAAGACC 

GAG-3 ' 


DS233 


TRPI 3' UTR 


5 '-GTCAAAGCTTTAAAGATAATGCTAAATCATTTGG-3 ' 


DS234 


TRPI 


5'-TGACAAGCTTTCGGTCGAAAAAAGAAAAGG AG 

AGG-3 ' 


DS235 


TRPI 


5'-TGACAAGCTTGATCTTTTATGCTTGCTTTTC-3' 


DS236 


TRPI 


5 '-AATAGTTCAGGCACTCCG-3 ' 


DS237 


TRPI 


5'-TGGAAGGCAAGAGAGCC-3 ' 


DS238 


TRPI 


5'-TAAAATGTAAGCTCTCGG-3 ' 


DS239 


TRPI 


5 ' -CCAACC AAGTATTTCGG-3 ' 


CED005 


ATRPI 


5 ' -GAGCTGACAGGGAAATGGTC-3 ' 


CED006 


ATRPI 


5 '-TACGAGGATACGGAGAGAGG-3 ' 



The 0.86kbp TRPI gene was isolated from pDB2778 by digestion with HindlLI 
and cloned into the HindUl site of pDB3069 to create pDB3078 (Figure 89) and 
pDB3079 (Figure 90). A 1.41kb pdiI::TRPI disrupting DNA fragment was 
15 isolated from pDB3078 or pDB3079 by digestion with NotllPstl. 
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Yeast strains incorporating a TRJPl deletion {p^lA) were to be constructed in such 
a way that no homology to the TRPl marker gene (pDB2778) should left in the 
genome once the ti^plA had been created, so preventing homologous 

5 recombination between future TRFl containing constructs and the TRPl locus. In 
order to achieve the total removal of the native TRFl sequence j&om the genome 
of the chosen host strains, oligonucleotides were designed to amplify areas of the 
5' UTR and 3' UTR of the TRPl gene outside of TRPl marker gene present on 
integrating vector YIplac204 (Gietz:& Sugino, 1988, Gene, 74, 527-534). The 

10 YIplac204 TRPl marker gene differs from the native/chromosomal TRPl gene in 
that internal HindlH, Pstl and Xbal sites were removed by site directed 
mutagenesis (Gietz & Sugino, 1988, Gene, 74, 527-534). The YIplac204 
modified TRPl marker gene was constructed from a 1.453kbp blunt-ended 
genomic fragment EcoBl fragment, which contained the TRPl gene and only 

15 102bp of the TRPl promoter (Gietz & Sugino, 1988, Gene, 74, 527-534). 
Although this was a relatively short promoter sequence it was clearly sufficient to 
complement trpl auxotrophic mutations (Gietz & Sugino, 1988, Gene, 74, 527- 
534). Only DNA sequences upstream of the EcdRi site, positioned 102bp 5' to 
the start of fbe TRPl ORF were used to create the 5' TRPl UTR. The selection of 

20 the 3 ' UTR was less critical as long as it was outside the 3 ' end of the functional 
modified TRPl marker, which was chosen to be 85bp downstream of the 
translation stop codon. 

Single stranded oligonucleotide DNA primers were designed and constructed to 
25 ampUfy the 5' UTR and 3' UTR regions of the TRPl gene so that during tlie PGR 
amplification restriction enzyme sites would be added to the ends of the PGR 
products to be used in later cloning steps. Prhners DS230 and DS231 (Table 12) 
amplified tbe 5* region of TRPl by PGR, while primers DS232 and DS233 (Table 
12) amplified a region 3' of TRPl, using S288c genomic DNA as a template. The 
30 PGR conditions were as follows: l\xL template S288c genomic DNA (at 
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0.01ng/|aL, O.lng/jxL, Ing/KiL, 10ng/|aL and lOOng/inL), 5|^L lOXBuffer (Fast 
Start Taq+Mg, (Roche)), IjllL lOmM dNTP's, 5|liL each primer (2^M), 0.4|aL Fast 
Start Taq, made up to SOjilL with H2O. PCRs were performed using a Perkin- 
Ehner Themial Cycler 9600. The conditions were: denature at 95°C for 4min 
5 [HOLD], then [CYCLE] denature at 95°C for 30 seconds, anneal for 45 seconds at 
45 °C, extend at 72°C for 90sec for 20 cycles, then [HOLD] 72°C for lOmin and 
then [HOLD] 4°C. 

The 0.19kbp TRPl 5' UTR PCR product was cut with EcoKL and HindlLl^ while 
10 the 0.2kbp TRPl y UTR PCR product was cut with BamHL and iSndlll and 
ligated into p A YES 05 linearised with BantHUEcoRl to create plasmid pDB2777 
(Figure 91). The construction of pAYE505 is described in WO 95/33833 . DNA 
sequencmg vising forward and reverse primers, designed to prime from the 
plasmid backbone and sequence the cloned inserts, confirmed that in both cases 
15 the cloned 5^ and 3* UTR sequences of the TRPl gene had the expected DNA 
sequence. Plasmid pDB2777 contained a TRPl dismpting fragment that 
comprised a fusion of sequences derived from the 5' and 3' UTRs of TRPL This 

0. 383kbp TRPl disraptitig fragment was excised from pDB2777 by complete 
digestion with EcoRI, 

20 

Yeast strain DXYl (Kerry-Williams et al, 1998, Yeast, 14, 16M69) was 
transformed to leucine prototrophy with the albumin expression plasmid pDB2244 
using a modified lithium acetate method (Sigma yeast transformation kit, YEAST- 

1, protocol 2; (Ito et al, 1983, J. Bacteriol, 153, 163; Elble, 1992, BiotecMiques, 
25 13, 1 8)) to create yeast strain DXYl [pDB2244] . The construction of the albumin 

expression plasmid pDB2244 is described in WO 00/44772. Transformants were 
selected on BMMD-agar plates, and were subsequently patched out on BMMD- 
agar plates. Cryopreserved trehalose stocks were prepared from 1 OmL BMMD 
shake flask cultures (24 hrs, 30°C, 200ipm). 
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DXYl [pDB2244] was transformed to tryptophan autotrophy with the 0.383kbp 
Ecom TRPl disrupting DNA fragment from pDB2777 using a nutrient agar 
incorporating the counter selective tryptophan analogue, 5-fluproanthranihc acid 
(5-FAA), as described by Toyn et al, (2000 Yeast 16, 553-560). Colonies 
5 resistant to the toxic effects of 5-FAA were picked and streaked onto a second 
round of 5-FAA plates to confirm that they really were resistant to 5-FAA and to 
select away from any background growth. Those colonies which grew were then 
were re-patched onto BMMD and BMMD plus tryptophan to identify which were 
tryptophan auxotrophs. 

10 

Subsequently colonies that had been shown to be tryptophan auxotrophs were 
selected for further analysis by transformation with YCplac22 (Gietz & Sugino, 
1988, Gene^ 74, 527-534) to ascertain which isolates were ti^pL 

15 PGR ampUfication across the TRPl locus was used to confirm that the trp*" 
phenotype was due to a deletion in this region. Genomic DNA was prepared from 
isolates identified as resistant to 5-FAA and unable to grow on mmimal media 
without the addition of tryptophan. PGR amplification of the genomic TRPl locus 
with pruners GED005 and CED006 (Table 12) was achieved as follows: IjiL 

20 template genomic DNA, 5\xL lOXBuffer (Fast Start Taq+Mg, (Roche)), l\xL 
lOmM dNTP's, 5\iL each primer (2|j,M), QA\\L Fast Start Taq, made up to 50fxL 
with H2O. PGRs were performed using a Perkin-Ehner Thermal Gycler 9600. 
The conditions were: denature at 94°G for lOmin [HOLD], then [GYGLE] 
denature at 94°G for 30 seconds, anneal for 30 seconds at 55°G, extend at 72°G 

25 for 120sec for 40 cycles, then [HOLD] 72°G for 10mm and then [HOLD] 4°G. 
PGR amplification of the wild type TRPl locus resulted in a PGR product of 
1.34kbp in size, whereas amplification across the deleted TRPl region resulted in 
a PGR product 0.84kbp smaller at 0.50kbp. PGR analysis identified a DXYl 
derived trp' strain (DXYl UplA [pDB2244]) as having the expected deletion 

30 event. 
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The yeast strain DXYl trplA [pDB2244] was cured of the expression plasmid 
pDB2244 as described by Sleep et al, (1991, Bio/Technology, 9, 183-187). 
DXYl trplA cir° was re-transformed the leucine prototrophy with either 
5 pDB2244, pDB2976, pDB2977, pDB2978, pDB2979, pDB2980 or pDB2981 
using a modified lithium acetate method (Sigma yeast transformation kit, YEAST- 
1, protocol 2; (Ito et al, 1983, J. Bacteriol, 153, 163; Elble, 1992, Biotechniques, 
13, 18)). Transformahts were selected on BMMD-agar plates supplemented with 
tryptophan, and were subsequently patched out on BMMD--agar plates 
10 supplemented with tryptophan. Cryopreserved trehalose stocks were prepared 
firom lOmL BMMD shake flask cultures supplemented with tryptophan (24 hrs, 
30°C, 200rpm). 

The yeast strains DXYl tiylA [pDB2976], DXYl tiplA [pDB2977], DXYl 
15 UplA [pDB2978], DXYl tiplA [pDB2979], DXYl tjplA [pDB2980] or DXYl 
trplA [pDB2981] was transformed to tryptophan prototrophy using the modified 
lithium acetate method (Sigma yeast transformation kit, YEAST- 1, protocol 2; 
(Ito et ah 1983, J. Bacteriol, 153, 163; Elble, 1992, Biotechniques, 13, 18)) with a 
1.41kb pdil::TRPl disrupting DNA fragment was isolated from pDB3078 by 
20 digestion with NotVFstl. Transformants were selected on BMMD-agar plates and 
were subsequently patched out on BMMD-agar plates. 

Six transformants of each strain were inoculated into lOmL YEPD in 50mL shake 
flasks and incubated in an orbital shaker at 30°C, 200rpm for 4-days. Culture 

25 supematants and cell biomass were harvested. Genomic DNA was prepared (Lee, 
1992, Biotechniques, 12, 677) from the tryptophan prototrophs and DXYl 
[pDB2244]. The genomic PDII locus amplified by PGR of with primers DS236 
and DS303 (Table 11 and 12) was achieved as follows: IjiiL template genomic 
DNA, 5|llL lOXBuffer (Fast Start Taq+Mg, (Roche)), IjuL lOmM dNTP's, 5|aL 

30 each primer (2|aM), OAyiL Fast Start Taq, made up to 50(xL with H2O. PGRs were 
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performed using a Perkin-Elmer Thermal Cycler 9700. The conditions were: 
denature at 94*^C for 4niin [HOLD], then [CYCLE] denature at 94°C for 30 
seconds, anneal for 30 seconds at 50°C, extend at 72°C for 60sec for 30 cycles, 
then [HOLD] 72°C for lOmin and then [HOLD] 4°C. PCR amplification of the 
5 wild type PDIl locus resulted in no PCR product, whereas amplification across 
the deleted PDIl region resulted in a PCR product 0.65kbp. PCR analysis 
identified that all 36 potential pdil::TRFl strains tested had the expected 
pdilr.TKPl deletion. 

10 The recombinant albumin titres were compared by rocket immunoelectrophoresis 
(Figure 92). Within each group, all six pdil::TRPl disruptants of DXYl trplA 
[pDB2976], DXYl tjplA [pDB2978], DXYl trplA [pDB2980], DXYl tiplA 
[pDB2977] and DXYl trplA [pDB2979] had very similar rHA productivities. 
Only the six pdilr.TRPl disruptants of DXYl tiplA [pDB2981] showed variation 

15 in rHA expression titre. The six pdil::TRPl disruptants indicated in Figiore 92 
were spread onto YEPD agar to isolate single colonies and then re-patched onto 
BMMD agar. 

Three single ceUed isolates of DXYl trplA pdil::TRPl [pDB2976], DXYl tiplA 
20 pdil::TRFl [pDB2978], DXYl trplA pdil::TRPl [pDB2980], DXYl trplA 
pdil::TRFl [pDB2977], DXYl tjplA pdil::TRPl [pDB2979] and DXYl trplA 
pdil::TRPl [pDB2981] along with DXYl [pDB2244], DXYl [pDB2976], DXYl 
[pDB2978], DXYl [pDB2980], DXYl [pDB2977], DXYl [pDB2979] and DXYl 
[pDB2981] were inoculated into lOmL YEPD in 50mL shalce flaslcs and incubated 
25 in an orbital shaker at 30°C, 200rpm for 4-days. Culture supematants were 
harvested and the recombinant albumin titres were compared by rocket 
immunoelectrophoresis (Figure 93). The thirteen wild type PDIl and pdil::TRPl 
disruptants indicated in Figure 93 were spread onto YEPD agar to isolate single 
colonies. One hundred single celled colonies from each strain were then re- 
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patched onto BMMD agar or YEPD agar containing a goat anti-HSA antibody to 
detect expression of recombinant albumin (Sleep et aL, 1991, Bio/Technology, 9, 
183-187) and tlie Leu+/rHA+, Len+/rHA-, Leu-/rHA+ or Leu-/rHA- phenotype of 
each colony scored (Table 13). 



Table 13: 





PDIl 


pdil::TRPl 


Leu+ 
rHA+ 


Leu- 
rHA+ 


Leu+ 
rHA- 


Leu- 
rHA- 


Leu+ 
rHA+ 


Leu- 
rHA+ 


Leu+ 
rHA- 


Leu- 
rHA- 


pDB2244 


100 


0 


0 


0 
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pDB2976 


7 


0 


47 


46 


97 


0 


3 


0 


pDB2978 


86 


0 


0 


14 


100 


0 


0 


0 


pDB2980 


98 


0 


0 


2 


100 


0 


0 


0 


pDB2977 


0 


0 


4 


96 


100 


0 


0 


0 


pDB2979 


69 


0 


6 


25 


100 


0 


0 


0 


pDB2981 


85 


0 


0 


15 


92 


0 


0 


8 



These data indicate plasmid retention is increased when the PDIl gene is used as a 
selectable marker on a plasmid in a host strain having no chromosomally encoded 
PDL even in a non-selective medium such as the exemplified rich medium. 
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